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(54) Title: REPRODUCTION-SPECIFIC GENES 

V© 

Q ( 57 ) Abstract: Reproduction-specific nucleic acid molecules, particularly those that are indicative of or associated with infertility 
in men, proteins encoded by these reproduction-specific nucleic acid molecules and antibodies that bind such proteins are described. 

Q Also described arc variant reproduction-specific genes and proteins, and antibodies which bind such proteins, as well as methods of 
using the reproduction-specific genes, proteins and antibodies and methods of using the variant reproduction-specific genes, proteins 

P^* and antibodies. 
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REPRODUCTION-SPECIFIC GENES 

RELATED APPLICATION 

This application claims the benefit of U.S. provisional application Serial No. 
60/187,518, filed on March 7, 2000, and U.S. provisional application Serial No. 
5 60/261,557, filed on January 12, 2001. The entire teachings of the above 
applications are incorporated herein by reference. 

BACKGROUND OF THE INVENTION 

Infertility is of great clinical significance, and between 2 and 7% of couples 
are infertile. Both physical and genetic factors are associated with male infertility. 

10 Some genetic factors are chromosomal aberrations, including: chromosomal 
translocations, Down's syndrome, Klinefelter' s syndrome and Y chromosome 
microdeletions. Many cases of azoospermia are idiopathic (have no obvious cause) 
in that the subject is infertile but otherwise healthy. Previous research has suggested 
that genetic factors are important contributors to these cases, but these factors have 

1 5 not been identified. 

SUMMARY OF THE INVENTION 

Spermatogonial stem cells are designated as undifferentiated spermatogonia; 

they are capable of self-renewal and persist as a constant population in adults. While 

renewing themselves, some of these stem cells begin to differentiate to give rise to 
20 type A spermatogonia. Type A spermatogonia divide four times and differentiate to 

eventually become type B spermatogonia. Type B spermatogonia divide once, enter 

meiosis at puberty, and eventually become mature sperm. 

Described herein are novel nucleic acid molecules, referred to as 

reproduction-specific nucleic acid molecules, from spermatogonia (the stem cells of 
25 male germ cells); novel reproduction-specific proteins; antibodies that bind the 
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proteins; and uses of the nucleic acid molecules or portions thereof, proteins and 
antibodies. The novel nucleic acid molecules of the present invention fall into three 
classes: 1) male germ cell-specific nucleic acid molecules, which are nucleic acid 
molecules that are expressed only in male germ cells; 2) testis-specific nucleic acid 
5 molecules, which are nucleic acid molecules that are expressed only in testis; and 3) 
testis-and ovary-specific nucleic acid molecules, which are nucleic acid molecules 
that are only expressed in testis and ovaiy. As further described herein, the present 
work has resulted in identification of a number of variants of the testis-specific 
genes, TAF2Q and TEX1 1 which are present on sex chromosome X. 

10 The present invention also relates to variant forms of reproduction-specific 

nucleic acid molecules (referred to as variant reproduction-specific nucleic acid 
molecules) that are indicative of or associated with infertility in men, proteins 
encoded by variant reproduction-specific nucleic acid molecules (referred to as 
variant reproduction-specific proteins), antibodies that bind such proteins, and 

15 methods of using the variant reproduction-specific nucleic acid molecules or 
portions thereof, proteins encoded by variant reproduction-specific nucleic acid 
molecules, andlnfibodiesthafbind variant reproduction-specific proteins. 

The present invention encompasses all of these nucleic acid molecules, their 
complements, portions of the nucleic acid molecules and their complements, and 

20 any nucleic acid molecules that, through the degeneracy of the genetic code, encode 
a protein whose sequence is presented herein or a protein encoded by nucleic acid 
molecules whose sequence is specifically presented herein. Nucleic acid molecules 
of the present invention (genes, genomic sequences, cDNAs and portions of the 
foregoing) are useful, for example, as hybridization probes and as primers for 

25 amplification methods which, in turn, are useful in methods of detecting the 
presence, absence or alteration of the nucleic acid molecules described herein. 

The present invention also relates to methods of identifying or determining 
differences in one or more of these reproduction-specific nucleic acid molecules that 
are associated with (indicative of) infertility in men. For example, nucleic acid 

30 molecules from tissues or body fluids, such as nucleic acid molecules in blood, 
obtained from one or more males with a known condition, such as lack of sperm 



WO 01/66752 



-3- 



PC17US01/07371 



production or reduced sperm count, can be assessed, using the nucleic acid 
molecule(s) described herein, or characteristic portions thereof, to determine 
whether the male(s) lacks some or all of the nucleic acid molecule(s) described 
herein or has a variant nucleic acid molecule(s) (e.g., in which there is a deletion, 
5 substitution, addition or mutation, compared to the sequences presented herein). 
Nucleic acid molecules (e.g., from a male with reduced sperm count or viability) can 
be assessed, using nucleic acid molecules described herein or nucleic acid molecules 
which hybridize to a nucleic acid molecule described herein, to determine whether 
they are associated with or causative for infertility (e.g., reduced sperm count or 

10 viability). For example, the presence or absence of all or a portion of a nucleic acid 
molecule or nucleic acid molecules shown to be necessary for fertility or adequate 
sperm count can be assessed, using nucleic acid molecules which hybridize to the 
nucleic acid molecule or nucleic acid molecules of interest to determine the basis 
for an individual's infertility or reduced sperm count. In one embodiment, the 

1 5 occurrence of one or more reproduction-specific nucleic acid molecules or a 

characteristic portion of one or more reproduction-specific nucleic acid molecules is 

assessed in a s ample_containing nucleic acid molecules. — — - — - — 

In another embodiment, deletion or alteration of one of the nucleic acid 
molecules described herein or a characteristic portion thereof is used to assess a 

20 nucleic acid sample obtained from a male who has a reduced sperm count or 

spermatogenic failure. Lack of hybridization of reproduction-specific nucleic acid 
molecules known to be present in fertile men, but not in infertile men, to nucleic 
acid molecules in the sample (sample nucleic acid molecules) indicates that the gene 
is not present in the sample nucleic acid molecules or is present in a variant form 

25 which does not hybridize to reproduction-specific nucleic acid molecules present in 
fertile men. In the present methods, sample nucleic acid molecule can be analyzed 
for the alteration or occurrence of one or more of the reproduction-specific nucleic 
acid molecules and can be analyzed for one or more of the three classes of nucleic 
acid molecules described herein. For example, a group of nucleic acid molecule 

30 probes (sequences) can be used to analyze sample nucleic acid molecule; the set of 
probes can include nucleic acid molecule probes which hybridize to two or more 
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reproduction-specific nucleic acid molecules or nucleic acid molecule probes which 
hybridize only to variant nucleic acid molecules characteristic of (indicative of) 
infertility in men. 

Nucleic acid molecules described herein are also useful as primers in an 
5 amplification method, such as PGR, useful for identifying and amplifying 
reproduction-specific nucleic acid molecules in a sample (e.g., blood). Further, 
proteins or peptides encoded by a reproduction-specific nucleic acid molecule can be 
assessed in samples. This can be carried out, for example, using antibodies which 
recognize proteins or peptides of the present invention (proteins or peptides encoded 

10 by nucleic acid molecules described herein or a variant thereof that is present in 
infertile men, but not in fertile men or vice versa). 

The present invention also relates to methods of diagnosing or aiding in the 
diagnosis of infertility in men, based on differences present in at least one of these 
nucleic acid molecules (between infertile men and fertile men). For example, one 

15 embodiment of this invention is a diagnostic method, such as a method of 

determining whether nucleic acid molecules from a man (e.g., obtained from blood, 

_other_tissue)xontain atleast one nucleic acid molecule which varies (comprisesa^ 

substitution, deletion, addition or rearrangement) from reproduction-specific nucleic 
acid molecules in a manner shown to be indicative of or characteristic of infertility 

20 The present invention further relates to proteins disclosed herein or encoded 

by nucleic acid molecules described herein, portions of the proteins (such as 
characteristic portions, referred to as characteristic peptides, useful in distinguishing 
between infertile and fertile men) and antibodies (monoclonal or polyclonal) that 
bind proteins of the present invention or characteristic portions thereof. The 

25 proteins of the present invention include proteins encoded by nucleic acid molecules 
whose sequence is disclosed herein; proteins whose amino acid sequences are 
disclosed herein; and proteins whose amino acid sequence differs from the amino 
acid sequence of proteins disclosed herein by at least one (one or more) residue and 
are associated with or indicative of azoospermia (lack of or reduction in sperm 

30 production), referred to as variant reproduction-specific proteins. Antibodies of the 
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present invention are useful in methods of diagnosing or aiding in the diagnosis of 
infertility in men. 

A further subject of the present invention is a method of contraception in 
which sperm production and/or function are altered, preferably reversibly. In the 
5 method, the function of one or more of the nucleic acid molecules or one or more of 
the proteins described herein is disrupted in a man, with the result that spenn 
production does not occur, occurs only to a limited extent (an extent less than 
normally occurs in the individual); or is otherwise altered (e.g., defective spenn, 
such as sperm with decreased motility or shortened lifespan, are produced). For 

1 0 example, a reproduction-specific gene shown to be present in fertile men, but not in 
infertile men, is targeted and its function (expression) is disrupted, with the result 
that the gene is not expressed, is expressed at a reduced level (at a level lower than if 
it the gene function had not been disrupted) or, when it is expressed, the resulting 
product is defective. Alternatively, a protein or proteins encoded by a reproduction- 

1 5 cell specific gene(s) is targeted and its function is disrupted and/or the protein is 

broken down (e.g., by proteolysis). Agents (drugs) useful in the meth od are also the _ 
subject of the present invention. 

Further, the present invention relates to a method of treating reduced sperm 
count, reduced sperm function, reduced sperm motility or spermatogenic failure. In 

20 one embodiment, reduced sperm count is increased by administering an agent that 
enhances the activity, of a reproduction-specific gene or genes. Preferably, such 
drugs target (act essentially exclusively upon) a reproduction-specific gene or 
portion thereof. Such drugs can be administered by a variety of routes, such as oral 
or intravenous administration. In another embodiment, a gene therapy method is 

25 used. For example, a one or more nucleic acid molecule(s) described herein, or a 
portion thereof which encodes a functional protein, is introduced into a man whose 
sperm count is reduced and in whom the nucleic acid molecule is expressed, and the 
resulting protein replaces or supplements the protein nonnally produced or enhances 
the quantity produced. 



WO 01/66752 



PC17US01/07371 



-6- 

The nucleic acid molecules, proteins and antibodies that bind proteins of the 
present invention, or portions thereof, are also useful as markers for spermatogonia! 
cells. 

As described herein, particular variants of the testis-specific X-linked 
5 TAF2Q and TEX1 1 nucleic acid molecules from infertile men were identified by 
methods described herein. These variants result from alternation in the nucleic acid 
molecule; some nucleic acid molecules alterations are silent (do not result in a 
change in amino acid), while others result in an amino acid alteration or in 
truncation of the encoded protein. These variants are associated with male 
10 infertility. The particular variants are useful in the methods described herein and are 
shown in Figures 107, 108, 111 and 112. 

Thus, the invention relates to an isolated reproduction-specific nucleic acid 
molecule comprising a nucleic acid molecule having a nucleotide sequence selected 
from the group consisting of SEQ ID NOS: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 
15 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 50, 52, 54, 55, 56, 58, 59, 61, 62, 
64, 66, 67, 69, 71, 73, 74 9 75, 77, 79, 81, 83, 86, 87 and 89 ; and the c omplement s 
thereof. 

The invention also relates to an isolated reproduction-specific nucleic acid 
molecule comprising a portion of a nucleic acid molecule having a nucleotide 
20 sequence selected from the group consisting of SEQ ID NOS: 1, 3, 5, 7, 9, 1 1, 13, 
15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 50, 52, 54, 55, 
56, 58, 59, 61, 62, 64, 66, 67, 69, 71, 73, 74, 75, 77, 79, 81, 83, 86, 87 and 89; and 
the complements thereof, wherein said portion is at least 14 contiguous nucleotides 
in length. 

25 The invention further relates to an isolated reproduction-specific nucleic acid 

molecule comprising a nucleic acid molecule which hybridizes under high 
stringency hybridization conditions to a nucleic acid molecule having a nucleotide 
sequence selected from the group consisting of SEQ ID NOS: 1, 3, 5, 7, 9, 1 1, 13, 
15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 50, 52, 54, 55, 

30 56, 58, 59, 61, 62, 64, 66, 67, 69, 71, 73, 74, 75, 77, 79, 81, 83, 86, 87 and 89; and 
the complements thereof. 
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The invention also relates to an isolated reproduction-specific nucleic acid 
molecule comprising a nucleic acid molecule having a nucleotide sequence which is 
at least 70% identical to a nucleotide sequence selected from the group consisting of 
SEQ ID NOS: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 
5 41, 43, 45, 47, 49, 50, 52, 54, 55, 56, 58, 59, 61, 62, 64, 66, 67, 69, 71, 73, 74, 75, 
77, 79, 81, 83, 86, 87 and 89; and the complements thereof. 

The invention further relates to an isolated reproduction-specific nucleic acid 
molecule which encodes a protein having an amino acid sequence selected from the 
group consisting of SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 
10 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 51, 53, 57, 60, 63, 65, 68, 70, 72, 76, 78, 80, 
82, 84, 85, 88, and 90. 

The invention further relates to an isolated variant reproduction-specific 
nucleic acid molecule comprising a nucleic acid molecule having the nucleic acid 
sequence of SEQ ID NO: 89 having one or more alterations selected from the group 
15 consisting of A320G, T325A, C381T, G400A, A491G, G1282A, C1449A, T2219C, 
A2250T, T2295C and T2472C. The invention also relates to an isolated variant 
r^od^tio^^^ific^ucleic acid^l^le^on^nsing a nucleicacid "molecule" 
having the nucleic acid sequence of SEQ ID NO: 50 having one or more alterations 
selected from the group consisting of the alterations shown in Figure 1 12. 
20 The invention also relates to nucleic acid constructs comprising an isolated 

reproduction-specific nucleic acid molecule according to the invention operably 
linked to at least one regulatory sequence, and to a host cell comprising such nucleic 
acid constructs. 

The invention also relates to an isolated protein comprising an amino acid 
25 sequence selected from the group consisting of SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 
16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 51, 53, 57, 60, 63, 
65, 68, 70, 72, 76, 78, 80, 82, 84, 85, 88, and 90. The invention also pertains to an 
isolated protein comprising a portion of an amino acid sequence selected from the 
group consisting of SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 
30 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 51, 53, 57, 60, 63, 65, 68, 70, 72, 76, 78, 80, 
82, 84, 85, 88, and 90, wherein said portion is at least 7 contiguous amino acids. 
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The invention is also drawn to an isolated protein comprising the amino acid 
sequence of SEQ ID NO: 90 having one or more alterations selected from the group 
consisting of W109R, V134I, G164R, N483K and V740A. The invention also 
relates to an isolated protein encoded by a nucleic acid molecule according to the 
5 invention. The invention further relates to an antibody which specifically binds a 
protein according to the invention. 

The invention also relates to a method of diagnosing infertility associated 
with alteration of a gene having a nucleotide sequence selected from the group 
consisting of SEQ ID NOS: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 

10 33, 35, 37, 39, 41, 43, 45, 47, 49, 50, 52, 54, 55, 56, 58, 59, 61, 62, 64, 66, 67, 69, 
71, 73, 74, 75, 77, 79, 81, 83, 86, 87 and 89, and whose alteration is associated with 
infertility, comprising the steps of: (a) obtaining a DNA sample to be assessed; (b) 
processing the DNA sample such that the DNA is available for hybridization; (c) 
combining the DNA of step (b) with nucleotide sequences complementary to the 

1 5 altered nucleotide sequence of said gene, whose alteration is associated with 
infertility, under conditions appropriate for hybridization of the probes with 
complementary nucleotide sequences in the DNA sample, thereby producing a 
combination; and (d) detecting hybridization in the combination, wherein presence 
of hybridization in the combination is indicative of infertility associated with an 

20 alteration of said gene. 

The invention also relates to a method of diagnosing infertility associated 
with alteration of a gene having a nucleotide sequence selected from the group 
consisting of SEQ ID NOS: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 
33, 35, 37, 39, 41, 43, 45, 47, 49, 50, 52, 54, 55, 56, 58, 59, 61, 62, 64, 66, 67, 69, 

25 71, 73, 74, 75, 77, 79, 81, 83, 86, 87 and 89, and whose alteration is associated with 
infertility, comprising the steps of: (a) obtaining a DNA sample to be assessed; (b) 
processing the DNA sample such that the DNA is available for hybridization; (c) 
combining the DNA of step (b) with nucleotide sequences complementary to the 
nucleotide sequence of said gene, whose alteration is associated with infertility, 

30 under conditions appropriate for hybridization of the probes with complementary 
nucleotide sequences in the DNA sample, thereby producing a combination; and (d) 
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detecting hybridization in the combination, wherein absence of hybridization in the 
combination is indicative of infertility associated with an alteration of said gene. In 
a preferred embodiment, infertility is a result of reduced sperm count, reduced sperm 
motility, malformed sperm, or combinations thereof. 

5 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the Spgl cDNA sequence. 

Figure 2 shows the Spgl encoded protein sequence. 

Figures 3a-3c show the Spg2 cDNA sequence. 

Figure 4 shows the Spg2 encoded protein sequence. 
10 Figures 5a-5b show the Spg3 cDNA sequence. 

Figure 6 shows the Spg3 encoded protein sequence. 

Figures 7a-7d show the Spg5 cDNA sequence. 

Figures 8a-8b show the Spg5 encoded protein sequence. 

Figures 9a-9b show the Spgl 3 cDNA sequence. 
15 Figure 10 shows the Spgl 3 encoded protein sequence. 



" ~~ "Figures lTaTfb show the Spgl4 cDNA sequence. 

Figures 12a- 12b show the Spgl 4 encoded protein sequence. 

Figures 13a-13b show the SpglS cDNA sequence. 

Figures 14a-14b show the SpglS encoded protein sequence. 
20 Figures 15a-15b show the Spgl6 cDNA sequence. 

Figure 16 shows the Spgl 6 encoded protein sequence. 

Figuresl7a-17b show the Spgl7 cDNA sequence. 

Figure 18 shows the Spgl7 encoded protein sequence. 

Figure 19 shows the Spgl 8 cDNA sequence 
25 Figure 20 shows the Spgl 8 encoded protein sequence. 

Figures 21a-21b show the Spg25 cDNA sequence. 

Figures 22a-22b show the Spg25 encoded protein sequence. 

Figure 23 shows the Spg27 cDNA sequence. 

Figure 24 shows the Spg27 encoded protein sequence. 
30 Figures 25a-25b show the Spg33 cDNA sequence. 
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Figure 26 shows the Spg33 encoded protein sequence. 
Figure 27 shows the Spg34 cDNA sequence. 
Figure 28 shows the Spg34 encoded protein sequence. 
Figures 29a-29b show the Spg39 cDNA sequence. 
Figure 30 shows the Spg39 encoded protein sequence. 
Figures 31a-31b show the Spg46 cDNA sequence. 
Figures 32a-32b show the Spg46 encoded protein sequence 
Figures 33a-33b show the Spg58 cDNA sequence. 
Figures 34a-34b show the Spg58 encoded protein sequence 
Figure 35 shows the Spg59 cDNA sequence. 
Figure 36 shows the Spg59 encoded protein sequence 
Figures 37a-37b show the Spg64 cDNA sequence. 
Figure 38 shows the Spg64 encoded protein sequence. 
Figures 39a-39b show the Spg65 cDNA sequence. 
Figure 40 shows the Spg65 encoded protein sequence. 
Figures 41a-41b show the Spg69 cDNA sequence. 
Figure 42 shows the Spg69 encoded protein sequence. 
Figures 43a-43b show the Spg70 cDNA sequence. 
Figure 44 shows the Spg70 encoded protein sequence. 
Figures 45a-45c show the Spg85 cDNA sequence, 
figure 46 shows the Spg85 encoded protein sequence. 
Figures 47a-47b show the Spg87 cDNA sequence. 
Figure 48 shows the Spg87 encoded protein sequence. 
Figures 49 shows the Spg84 cDNA sequence. 
Figure 50 shows the hSPGl cDNA sequence. 
Figure 51 shows the hSPGl encoded protein sequence. 
Figures 52a-52b show the hSPG3a cDNA sequence. 
Figure 53 shows the hSPG3a encoded protein sequence. 
Figures 54a-54e show the hSPG3a genomic DNA sequence. 
Figure 55 shows the hSPG3b cDNA sequence. 
Figures 56a-56d show the hSPG5 cDNA sequence. 
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Figures 57a-57b show the hSPG5 encoded protein sequence. 

Figures 58a-58e show the hSPG5 genomic DNA sequence. 

Figures 59a-59c show the hSPG15 cDNA sequence. 

Figure 60 shows the hSPG15 encoded protein sequence. . . 

5 Figures 61a-61t show the hSPG15 genomic DNA sequence. 

Figure 62 shows the hSPG18 cDNA sequence. 

Figures 63a-63b show the hSPG18 encoded protein sequence. 

Figures 64a-64b show the hSPG25 cDNA sequence. 

Figure 65 shows the hSPG25 encoded protein sequence. 
10 Figure 66 shows the hSPG27 cDNA sequence. 

Figures 67a-67b show the hSPG34a cDNA sequence. 

Figure 68 shows the hSPG34a encoded protein sequence. 

Figure 69 shows the hSPG34b cDNA sequence. 

Figure 70 shows the hSPG34b encoded protein sequence. 
15 Figures 71a-71b show the hSPG39a cDNA sequence. 

Figure 72 shows the hSPG39a encoded protein sequence. _____ 

Figure 73a and 73b show the hSPG39a genomic DNA sequence. 

Figure 74 shows the hSPG39b cDNA sequence. 

Figures 75a-75b show the hSPG46 cDNA sequence. 
20 Figures 76a-76b show the hSPG46 encoded protein sequence. 

Figures 77 shows the hSPG64 cDNA sequence. 

Figures 78a-78b show the hSPG64 encoded protein sequence. 

Figures 79a-79b show the hSPG85 cDNA sequence. 

Figure 80 shows the hSPG85 encoded protein sequence. 
25 Figures 81a-81b show the hSPG13 cDNA long form sequence. 

Figure 82 shows the sequence of the protein encoded by hSPG13 long fonn. 

Figures 83a-83b show is the hSPG13 cDNA short form sequence. 

Figure 84 shows the sequence of the protein encoded by hSPG13 short form. 

Figure 85 shows the hSPG39b encoded protein sequence. 
30 Figures 86a-86b show the hSPG39b genomic DNA sequence. 

Figures 87a-87b show the hSPG70 cDNA sequence. 
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Figure 88 shows the hSPG70 encoded protein sequence. 
Figures 89a and 89b show the nucleic acid sequence 
ofTEXll(SEQIDNO: 89). 

Figure 90 shows the amino acid sequence of TEX11 (SEQ ID NO: 90). 
5 Figure 91 depicts the identification of spermatogonia-specific 

genes by cDNA subtraction. 

Figure 92 depicts the known germ cell-specific genes enriched by 
.subtraction. 

Figure 93 depicts the genes identified by the subtraction. 
10 Figure 94 depicts the novel mouse germ cell specific genes 

identified by subtraction. 

Figure 95depicts the post-transcriptional gene regulation 
of germ cell development. 

Figure 96 depicts the abundance of male germ-cell-specific 
1 5 genes on X Chromosome. 

Figure 97 depicts the rapid evolution of spermatogonia-specific 

genes in mouse and human: " 

Figure 98 depicts hybrid male sterility in mice. 

Figure 99 depicts candidate genes for Hst-3. 
20 Figure 100 depicts the 14 novel human testis-specific genes. 

Figure 101 depicts the BAC physical map and gene structure of TEX1 1. 

Figure 102 depicts the high throughput mutation screening by 

genomic sequencing. 

Figure 103 depicts the mutations found in infertile but not fertile males 
25 Figure 104 depicts the clustering of mutations in 3 ! but not 5' regions 

ofintrons ofTEXll. 

Figure 105 depicts the epigenetic down regulation of X-linked genes 
during male meiosis. 

Figure 106 depicts the abundance of spermatogonia genes on the 
30 X Chromosomes. 

Figure 107 depicts the intronic variants in TEX1 1. 
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Figure 108 depicts the coding variants.in TEX1 1. 

Figure 109 is a pedigree chart of WHT3759 depicting infertility as a 

result of mutations in TEX1 1. 

Figure 110 depicts the coding variants found in infertile but not fertile males. 
5 Figure 1 1 1 is a pedigree chart of WHT2508 depicting a mutation 

in TAF2Q resulting in infertility. 
Figure 1 1 2 depicts the variants in TAF2Q. 

Figures 1 13a, 1 13b and 1 13c depict the twenty-three spermatogonially 
expressed, germ cell specific genes in mouse and their humun orthologs. 

10 

DETAILED DESCRIPTION OF THE INVENTION 

A description of preferred embodiments of the invention follows. 
Described herein are isolated reproduction-specific nucleic acid molecules 
which are male germ cell-specific, testis-specific or testis-and ovary-specific. Also 

15 described are portions of the reproduction-specific nucleic acid molecules; 

complements of the reproduction-specific nucleic acid molecules and portions 

thereof and; nucleic acid molecules which hybridize to any of the reproduction- 
specific nucleic acid molecules under conditions of high stringency. Also described 
are nucleic acid molecules which are at least 70% identical in sequence to a 

20 reproduction-specific nucleic acid molecule whose sequence is presented herein or 
to a nucleic acid molecule which encodes a reproduction-specific protein whose 
amino acid sequence is presented herein, or to a nucleic acid molecule which 
hybridizes to any of the reproduction-specific nucleic acid molecules under 
conditions of high stringency. 

25 Particularly preferred are nucleic acid molecules and portion thereof 

which have at least about 60%, preferably at least about 70, 80 or 85%, more 
preferably at least about 90%, even more preferably at least about 95%, and most 
preferably at least about 98% identity with nucleic acid molecules described herein. 
In one embodiment, the nucleic acid molecules hybridize under high 

30 stringency hybridization conditions (eg., for selective hybridization) to a nucleotide 
sequence described herein. 
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Stringent hybridization conditions for nucleic acid molecules are well known 
to those skilled in the art and can be found in standard texts such as Current 
Protocols in Molecular Biology, John Wiley & Sons,N.Y. (1998), pp. 2.10.1- 
2.10.16 and 6.3.1-6.3.6, the teachings of which are hereby incorporated by reference. 
5 As understood by those of ordinary skill, the exact conditions can be determined 
empirically and depend on ionic strength, temperature and the concentration of 
destabilizing agents such as formamide or denaturing agents such as SDS. Other 
factors considered in detennining the desired hybridization conditions include the 
length of the nucleic acid sequences, base composition, percent mismatch between 

10 the hybridizing sequences and the frequency of occurrence of subsets of the 
sequences within other non-identical sequences. In one non-limiting example, 
nucleic acid molecules are allowed to hybridize in 6X sodium chloride/sodium 
citrate (SSC) at about 45*C, followed by one or more low stringency washes in 0.2X 
SSC/0. 1% SDS at room temperature, or by one or more moderate stringency washes 

15 in 0.2X SSC/0. 1% SDS at 42°C, or washed in 0.2X SSC/0.1% SDS at 65'C for high 
stringency. Thus, equivalent conditions can be determined by varying one or more 
of these parameters while maintaining a similar degree of identity or similarity 
between the two nucleic acid molecules. Typically, conditions are used such that 
sequences at least about 60%, at least about 70%, at least about 80%, at least about 

20 90% or at least about 95% or more identical to each other remain hybridized to one 
another. 

The percent identity of two nucleotide or amino acid sequences can be 
determined by aligning the sequences for optimal comparison purposes (e.g., gaps 
can be introduced in the sequence of a first sequence). The nucleotides or amino 

25 acids at corresponding positions are then compared, and the percent identity between 
the two sequences is a function of the number of identical positions shared by the 
sequences (i.e., % identity = # of identical positions/total # of positions x 100). In 
certain embodiments, the length of a sequence aligned for comparison purposes is at 
least 30%, preferably at least 40%, more preferably at least 60%, and even more 

30 preferably at least 70%, 80% or 90% of the length of the reference sequence. The 
actual comparison of the two sequences can be accomplished by well-known 
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methods, for example, using a mathematical algorithm. A non-limiting example of 
such a mathematical algorithm is described in Karlin et aL, Proc. Natl. Acad, Sci. 
USA, 90:5873-5877 (1993). Such an algorithm is incorporated into the NBLAST 
and XBLAST programs (version 2.0) as described in Altschul et aL, Nucleic Acids 
5 Res., 25:389-3402 (1997). When utilizing BLAST and Gapped BLAST programs, 
the default parameters of the respective programs (e.g., NBLAST) can be used. See 
http://www.ncbi.nlm.nih.gov. In one embodiment, parameters for sequence 
comparison can be set at score=100, wordlength=12, or can be varied (e.g., W=5 or 
W=20). 

10 A mathematical algorithm utilized for the comparison of sequences is the 

algorithm of Myers and Miller, CABIOS (1989). Such an algorithm is incorporated 
into the ALIGN program (version 2.0) which is part of the CGC sequence alignment 
software package. When utilizing the ALIGN program for comparing amino acid 
sequences, a PAM120 weight residue table, a gap length penalty of 12 , and a gap 

15 penalty of 4 can be used. Additional algorithms for sequence analysis are known in 
the art and include ADVANCE and ADAM as described in Torellis and Robotti 
(1994) Comput. Appl Bioscl, 70:3-5; and FASTA described in Pearson and Lipman 
(1988) PNAS, 55:2444-8. 

The percent identity between two amino acid sequences can be accomplished 

20 using the GAP program in the CGC software package (available at 

http://www.cgc.com) using either a Blossom 63 matrix or a PAM250 matrix, and a 
gap weight of 12, 10, 8, 6, or 4 and a length weight of 2, 3, or 4. In yet another 
embodiment, the percent identity between two nucleic acid sequences can be 
accomplished using the GAP program in the CGC software package (available at 

25 http://www.cgc.com), using a gap weight of 50 and a length weight of 3. Thus, a 
substantially homologous amino acid or nucleotide sequence means an amino acid 
or nucleotide sequence that is largely but not wholly homologous to a nucleic acid 
molecule described herein, and which retains the same functional activity as the 
molecule to which it is homologous. 

30 Also described herein are variant reproduction-specific nucleic acid 

molecules which are characteristic/indicative of infertility in men; mRNAs from 
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which the cDNA is transcribed (mRNAs that encode the cDNA); proteins encoded 
by each of the nucleic acid molecules presented herein and by variations thereof 
(nucleic acid molecules that, due to the degeneracy of the genetic code, encode an 
amino acid sequence presented herein or a functional equivalent thereof); variant 
5 proteins associated with or indicative of lack of or reduction in sperm count (variant 
reproduction-specific proteins); characteristic portions of each of the proteins 
described herein; and antibodies that bind reproduction-specific proteins or variant 
reproduction-specific proteins or characteristic portions of these proteins. 

The SEQ ID NO. for each of the sequences presented herein is shown in 
10 Table 1. Where shown, lower case letters in the figures indicate untranslated regions 
of the DNA. However, not all untranslated regions are shown in lower case letters. 
The skilled artisan can determine the appropriate coding region for each cDNA 
described herein using methods (e.g., computer programs) that are routine in the art. 
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Table 1 List of Sequence ID Numbers for cDNA, Protein and Genomic Sequences 



SEQID 
NO. 


3ene Name 


3ene Symbol 


Sequence 


GenBank- NO. 


i 
i 


2>pgi 




cDNA 


of iOJJ /*t 


L 


Crml 


Taf2q 


Protein 




5 


5>pgZ 


lexi i 




AF285572 


4 


Spg2 


Texl 1 


Protein 


AF285572 


5 


Spg3 


Nxf2 


cDNA 


AF285575 


6 


Spg3 


Nxf2 


Protein 


AF285575 


7 


Spg5 


Tex 1 5 


cDNA 


AF2 85589 


8 


Spg5 


Tex 15 


Protein 


AF285589 


9 


Spgl3 


Rnfl7 


cDNA 


i t^<^ nr for 

AF285585 


in 


opgi^ 


Pr.fi 7 
Mil 1. / 


Protein 




11 




Scmh2 


cDNA 


AF285577 


12 




Scmh2 


Protein 


AF285577 


13 




MovlOU 


cDNA 


AF285587 


14 




MovlOll 


Protein 


AF285587 


15 


Spgl6 


Piwil2 


cDNA 


AF285586 


16 


Spgl6 


Piwil2 


Protein 


AF285586 


17 




Tktll 


cDNA 


AF285571 


18 


Spb17 


Tktll 


Protein 


AF285571 


19 


Spgl8 


Texl 2 


cDNA 


AF285582 


20 


Sdk18 


Texl2 


Protein 


AF285582 


21 


Sde25 


Usp26 


cDNA 


AF285570 


22 


Sne25 


Usp26 " 


Protein 


AF285570 


23 






cDNA 




24 


SPg27 




Protein 




25 


Sds33 


Tex 19 


cDNA 


(AF285590 


26 


Spg33 


Texl9 


Protein 


AF285590 


27 


Spg34 


Fthll7 


cDNA 


AF285569 


28 


Spg34 


Fthll7 


Protein 


AF285569 


29 


SPg39 


Texl3 


cDNA 


AF285576 


30 


Spg39 


TexI3 


Protein 


AF285576 


31 


Spg46 


Stk3i 


cDNA 


AF285580 


32 


Spg46 


Stk31 


Protein 


AF285580 


33 


Spg58 


Texl 6 


cDNA 


AF285573 


34 


Spg58 


Texl6 


Protein 


AF285573 


35 


Spg59 


Tex20 


cDNA 


AF285588 


36 


Spg59 


Tex20 


Protein 


AF285588 


37 


Spg64 




cDNA 




38 


Spg64 




Protein 




39 


Spg65 


Rnh2 


cDNA 


AF285581 


40 


Spg65 


Rnh2 


Protein 


AF285581 


41 


Spg69 


Pramell 


cDNA 


AF285578 


42 


Spg69 


Pramell 


Protein 


AF285578 


43 


Spg70 


Tdrdl 


cDNA 


AF285591 


44 


Spg70 


Tdrdl 


Protein 


AF285591 
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45 


Spg85 


Texl4 


cDNA 


AF285584 


46 


Spg85 


Texl4 


Protein 


AF285584 


47 


Spg87 


Texl8 


cDNA 


AF285583 


48 


Spg87 


Texl8 


Protein 


AF285583 


49 


Spg84 


Texl7 


cDNA 


AP285579 


50 


hSPGl 


TAF2Q 


cDNA 


AF285595 


51 


hSPGl 


TAF2Q 


Protein 


AF285595 


52 


hSPG3a 


NXF2 


cDNA 


AF285596 


53 


hSPG3a 


NXF2 


Protein 


AF285596 


54 


hSPG3a 




Genomic 




55 


hSPG3b 




cDNA 




56 


hSPG5 


TEX15 


cDNA 


AF285605 


57 


hSPG5 


TEX 1 5 


Protein 


AF285605 


58 


HSPG5 




Genomic 




59 


hSPG15 


MOV10L1 


cDNA 


AF285604 


60 


hSPG15 


MOV10L1 


Protein 


AF285604 


61 


hSPG15 




Genomic 




62 


hSPG18 


TEX12 


cDNA 


AF285600 


63 


hSPG18 


TEX 12 


Protein 


AF285600 


64 


hSPG25 


USP26 


cDNA 


AF285593 


65 


hSPG25 


USP26 


Protein 


AF285593 


66 


hSPG27 




cDNA 




67 


hSPG34a 




cDNA 




68 


hSPG34a 




Protein 




69— — 


hSPG34b^r 


FTHL17" 


cDNA " 


AF285592 


70 


hSPG34b 


FTHL17 


Protein 


AF285592 


71 


hSPG39a 


TEX13A 


cDNA 


AF285597 


72 


hSPG39a 


TEX13A 


Protein 


AF285597 


73 


hSPG39a 




Genomic 




74 


hSPG39b 


TEX13B 


cDNA 


AF285598 


/J 


IlorO40 




CDNA 


AF285599 


76 


hSPG46 


STK31 


Protein 


AF285599 


*7*7 


tlorUo4 




cDNA 




78 


hSPG64 




Protein 




79 


hSPG85 


TEX 14 


cDNA 


AF285601 ! 


80 


HSPG85 


TEX 14 


Protein 


AF285601 


ftl 


liorvjl j 

long 


PMPI7 ! 
IvlNr 1 / 


CJJNA 


AF285602 


82 


hSPG13 
long 


RNF17 


Protein 


AF285602 


83 


hSPG13 
short 


RNF17 


cDNA 


AF285603 


84 


hSPGI3 
short 


RNF17 


Protein 


AF285603 


85 


hSPG39b 


TEX13B 


Protein 


AF285598 


86 


hSPG39b 




Genomic 
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87 


hSPG70 


TDRD1 


cDNA 


AF285606 


88 


hSPG70 


TDRD1 


Protein 


AF285606 


89 


hSPG2 


TEX11 


cDNA 


AF285594 


90 


hSPG2 


TEX11 


Protein 


AF285594 



5 

As used herein, the terms "reproduction-specific nucleic acid molecules" and 
"reproduction-specific genes" refer, respectively, to reproduction-specific nucleic 
acid molecules and reproduction-specific genes which are male germ cell-specific, 
testis-specific or testis- and ovary-specific. As used herein, the terms "variant 
10 reproduction-specific nucleic acid molecules" and "variant reproduction-specific 
genes" refer, respectively, to reproduction-specific nucleic acid molecules and 
reproduction-specific genes which are male germ cell-specific, testis-specific or 
testis- and ovary-specific. Variant reproduction-specific nucleic acid molecules or 
genes can differ from reproduction-specific nucleic acid molecules in nucleic acid 
15 sequence (e.g., deletion of one or more nucleotides, addition of one or more 

nucleotides or substitution or change in one or more nucleotides) or by their "loss" 

either physically or by failure of/or reduction in expression. 

As used herein, the term "isolated" refers to substances which are obtained 
from (separated from) the sources in which they occur in nature, as well as to 
20 substances (e.g., nucleic acid molecules, proteins, peptides) produced by 

recombinant/genetic engineering methods or by synthetic (chemical) methods. 

Also the subject of the present invention are methods in which the nucleic 
acid molecules, proteins, and antibodies of the present invention are useful. Such 
methods include a method of identifying genes or proteins characteristic of male 
25 infertility, which include variant genes or proteins present in infertile men, but not in 
fertile men, and nucleic acid molecules or proteins present at different levels or at a 
different stage(s) in differentiation in infertile men than in fertile men. Also 
included is a method of diagnosing or aiding in the diagnosis of infertility in men, 
and a method of contraception in which sperm production or sperm count is reduced 
30 (no sperm is produced, sperm is produced to a lesser extent than normal in an 
individual) or defective sperm is produced (e.g., sperm with reduced motility, 
lifespan or testicular maturation arrest, or sertoic cell defects ). As used herein, the 
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terms "infertility in men" or "male infertility" include spermatogenic failure, a lack 
of sperm production, a severely reduced spenn count and production of defective 
sperm, each of which results in the inability or a severely reduced ability to cause 
fertilization. 

5 Texl 1 is a reproduction-specific gene that is X chromosome-linked. Its 3kb 

cDNA encodes a 917-residue protein that has no homology with other known 
proteins. The Texl 1 gene is approximatly 400kb and consists of 29 exons. As 
described in Example 2, 380 infertile males and 93 fertile males (fathers) were 
studied and 33 mutations were found in the nucleic acid sequence of TEX1 1; of 

10 these, 21 were found only in infertile males. These mutations include A320G, 

T325A, C381T, G400A, A491G, G1282A, C1449A, T2219C, A2250T, T2295C and 
T2472C and also shown is a two base pair insertion in exon 15 at nucleotide position 
1233 (denoted as ins(2bp)) in Figure 108. A clustering of mutations is found in the 
y but not the 5 f regions of the intron. These nucleic acid alterations are shown in 

15 Figure 108. 

Another X linked reproduction-specific gene identified as containing variants 
as described herein is TAF2Q. The TAF2Q DNA and amino acid variations 
associated with infertility are shown in Figure 112. 

Isolated nucleic acid molecules (nucleic acid molecule genes, cDNAs, 
20 mRNA, RNA) of the present invention are of mammalian origin, such as of mouse 
(designated as Spg) , human (designated as hSpg) or other primate, canine, feline or 
bovine origin. 

Both reproduction-specific nucleic acid molecules and variant reproduction- 
specific nucleic acid molecules are useful as hybridization probes or primers for an 

25 amplification method, such as polymerase chain reaction, to show the presence, 
absence or alteration of a gene(s) described herein. Probes and primers can 
comprise all or a portion of the nucleotide sequence (nucleic acid sequence) of a 
reproduction-specific nucleic acid molecule described herein or all or a portion of its 
complement. They can also comprise all or a portion of a variant reproduction- 

30 specific nucleic acid molecule which portion is characteristic of (indicative of) 
infertility or all or a portion of its complement. The probes and primers can be of 
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any length, provided that they are of sufficient length and appropriate composition 
(appropriate nucleotide sequence) to hybridize to all or an identifying or 
characteristic portion of a gene indicative of infertility in men and remain hybridized 
under the conditions used. Useful probes include nucleic acid molecules which 
5 distinguish between a reproduction-specific nucleic acid molecule described herein 
and a variant form of such a nucleic acid molecule that is indicative of infertility in 
men. Generally, the probe will be at least 14 nucleotides; the upper limit is the 
length of the nucleic acid molecule itself. Probes can be, for example, 14 to 20 
nucleotides or longer (e.g., 15 to 25, 20 to 40, 30 to 50 or any other length 

10 appropriate to specifically hybridize to a reproduction-specific gene or a variant 
reproduction-specific nucleic acid molecule and remain hybridized to nucleic acid 
molecules in a sample under the conditions used). The length of a specific probe 
will also be determined by the method in which it is used. 

The genes described herein are useful to detect variant reproduction-specific 

15 nucleic acid molecules present in a nucleic acid molecule sample obtained from men 
with lack of or reduction in sperm production, but not present in a nucleic acid 
^"molecule sampleobtained from fertile men. Variant reproduction-specific nucleic 
acid molecules (e.g., having large alterations or deletions and small alterations or 
deletions such as short deletions, point mutations and small insertions) can be 

20 identified with reference to reproduction-specific nucleic acid molecules/gene 

sequences presented herein. For example, nucleic acid molecules from infertile men 
with normal karyotypes and no Y chromosome microdeletions can be assessed All 
human spermatogonic genes can be screened in a group of infertile men (with no or 
low sperm counts) using PCR. One pair of PCR primers can be designed for each 

25 spermatogonic gene to produce a 200 bp PCR product or a PCR product of any 

appropriate length. A negative PCR result indicates the absence of a particular gene 
in an individual and can be confirmed by Southern blot. Small variations can be 
searched for in X-linked genes by nucleic acid molecule sequencing. Fertile men are 
used as controls. If a variant reproduction-specific gene is identified, additional 

30 infertile men can be similarly screened to further confirm that the variant 
reproduction-specific nucleic acid molecule is associated with/indicative of 
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infertility in men. Alterations which are specific to infertile men can be used in the 
diagnosis of male infertility, alone or in conjunction with other methods of assessing 
male infertility. 

The spermatogonic genes are strong candidates for pure male sterility factors. 
5 A mutation in such a gene could alter its function in spermatogenesis and therefore 
cause male infertility. These novel genes are promising for the following reasons: 
first, they are germ cell-specific and expressed in spermatogonia. Two known germ 
cell-specific Y-linked human genes, RBM and DAZ, are also expressed in 
spermatogonia and are strongly implicated in male infertility when deleted. The 

10 mouse homologues of RBM and DAZ were also identified in the subtraction 
protocol described in the Examples, suggesting an important role for other 
spermatogonic genes in male fertility. Second, nearly 50% of novel germ cell- 
specific genes are located on Chromosome X. This is significant from a theoretical 
point of view, indicating that Chromosome X may play the most important role in 

15 male fertility. From a practical point of view, this result shows that mutations in 
infertile men are more likely to be found in X-linked genes than in autosomal genes. 

It is dso far easier to'search to^ males, 

there is only one copy of the X-linked gene. For example, to find a mutation with a 
frequency of 1% in the population, one can screen 100 individuals if it is X-linked. 

20 If the gene is autosomal, one has to screen 10,000 individuals (1% x 1% = 0.01%) to 
find a homozygous mutation. However, the method described herein applies to the 
search for variations in infertile men in both X-linked and autosomal genes of this 
invention. 

In a further embodiment, the present invention is a method of diagnosing 
25 reduced (partially or totally) sperm count or infertility in a man. For example, a 
method of diagnosing infertility in a man comprises (a) comparing the nucleic acid 
sequence of reproduction-specific nucleic acid molecules obtained from a man in 
whom infertility is to be assessed with the nucleic acid sequence of a corresponding 
variant reproduction-specific nucleic acid molecules from infertile men, wherein the 
30 corresponding variant reproduction-specific nucleic acid molecules comprises an 
alteration characteristic of infertility in men; and (b) determining whether the 
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alteration characteristic of infertility in men is present in the reproduction-specific 
nucleic acid molecules obtained from the man in whom fertility is to be assessed. If 
the alteration is present in the nucleic acid molecules obtained, infertility is 
diagnosed in the man. A corresponding variant reproduction-specific nucleic acid 
5 molecule is a reproduction-specific nucleic acid molecule of the same chromosomal 
location as the chromosomal location of nucleic acid molecule being analyzed (a 
nucleic acid molecule obtained from a man being assessed). One or more of the 
nucleic acid molecules described herein, or a portion(s) of one or more of the nucleic 
acid molecules or nucleic acid molecules that hybridize to nucleic acid molecules 

10 described herein or to a complement thereof can be used in a diagnostic method, 
such as a method to determine whether a gene(s) or a portion of a gene(s) described 
herein is missing or altered in men. Any man may be assessed with this method of 
diagnosis. In general, the man will have been at least preliminarily assessed, by 
another method, as having reduced sperm count. By combining nucleic acid probes 

1 5 derived from a sequence presented herein that is present in the DNA of fertile men, 
but not in the DNA of infertile men, with the nucleic acid molecules from a sample _ 
to be assessed, under conditions suitable for hybridization of the probes with DNA 
present in fertile men, but not with variant DNA, it can be determined whether the 
sample from a man to be assessed comprises thevariant reproduction-specific 

20 nucleic acid molecules. If the nucleic acid molecule is unaltered (is not a variant 
reproduction-specific nucleic acid molecules), it may be concluded that the 
alteration of the gene is not responsible for the reduced sperm count. Alternatively, 
the hybridization conditions used can be such that the probes will hybridize only 
with variant reproduction-specific nucleic acid molecules and not with reproduction- 

25 specific nucleic acid molecules. 

> 

Nucleic acid molecules assessed by the present method can be obtained from 
a variety of tissues and body fluids, such as blood or semen. In one embodiment, the 
above methods are carried out on nucleic acid molecules obtained from a blood 
sample. For example, a nucleic acid sample from men who are infertile or have a 
30 low sperm count is assessed to determine whether all or a portion of a nucleic acid 
molecule(s) described herein differs in sequence from the sequence of a 
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corresponding nucleic acid molecule obtained from fertile men. In one embodiment, 
the altered nucleic acid molecules or gene which is assessed is one which differs 
from a sequence described herein by a deletion, addition or substitution of at least 
one nucleotide. In a second embodiment, the altered nucleic acid molecule or gene 
5 is "missing" in that it is physically absent or not expressed/under-expressed 
(functionally absent). If an alteration occurs in a nucleic acid molecule obtained 
from infertile men, but not fertile men, it is indicative of (characteristic of) infertility 
and, thus, useful in the diagnosis of infertility in men. Such a nucleic acid molecule 
or gene is referred to as variant reproduction-specific nucleic acid molecule or 

1 0 variant reproduction-specific gene. 

This invention also relates to proteins encoded by the genes or portions of the 
genes described herein, proteins encoded by variant nucleic acid molecules (or 
portions thereof) that are characteristic of infertility in men), or by portions thereof 
and antibodies that recognize (bind) proteins described herein. Such antibodies are 

15 useful in a diagnostic method to determine whether an intact or variant protein(s) is 
present in a sample (e.g., semen or testis biopsy) obtained from a man being 
assessed for infertility. They are also useful for identifying the expression of the 
gene(s) in a particular cell type or at a particular developmental stage. These 
antibodies can be used for studies of spermatogenesis. These antibodies can be used 

20 for immunofluorescence of germ cells, or in Western blots for assessing the presence 
of the protein the antibody binds. 

The invention also provides expression vectors containing a reproduction- 
specific nucleic acid molecule of the present invention which is operably linked to at 
least one regulatory sequence. "Operably linked" is intended to mean that the 

25 nucleotide sequence is linked to a regulatory sequence in a manner which allows 
expression of the nucleotide sequence. The term "regulatory sequence" includes 
promoters, enhancers, and other expression control elements (see, e.g., Goeddel, 
Gene Expression Technology: Methods in Enzymology 185, Academic Press, San 
Diego, CA (1990)). It should be understood that the design of the expression vector 

30 may depend on such factors as the choice of the host cell to be transformed and/or 
the protein or peptide desired to be expressed. For instance, the proteins and 
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peptides of the present invention can be produced by ligating the cloned gene, or a 
portion thereof, into a vector suitable for expression in either prokaryotic cells, 
eukaryotic cells or both (see, for example, Broach, et al y Experimental Manipulation 
of Gene Expression, ed M. Inouye (Academic Press, 1993) p. 83; Molecular 
5 Cloning: A Laboratory Manual, 2 nd Ed., Sambrook et al (Cold Spring Harbor 
Laboratory Press, (1989) Chapters 16 and 17). 

Prokaryotic and eukaryotic host cells transfected by the described vectors are 
also provided by this invention. For instance, cells which can be transfected with the 
vectors of the present invention include, but are not limited to, bacterial cells, such 

10 as E. coli, insect cells (baculovirus), yeast and mammalian cells, such as Chinese 
hamster ovary (CHO) cells. 

Thus, a nucleotide sequence described herein can be used to produce a 
recombinant form of the encoded protein via microbial or eukaryotic cellular 
processes. Production of a recombinant form of the protein can be carried out using 

1 5 known techniques, such as by ligating the oligonucleotide sequence into a DNA or 
RNA construct, such as an expression vector, and transforming or transfecting the 

constmct into host cells,~eitbe^ or 

prokaryotic (bacterial cells). Similar procedures, or modifications thereof, can be 
employed to prepare recombinant proteins according to the present invention by 

20 microbial means or tissue-culture technology. 

The present invention also pertains to pharmaceutical compositions 
comprising the proteins and peptides described herein. For instance, the peptides or 
proteins of the present invention can be formulated with a physiologically acceptable 
medium to prepare a pharmaceutical composition. The particular physiological 

25 medium may include, but is not limited to, water, buffered saline, ployols (e.g., 
glycerol, propylene glycol, liquid polyethylene glycol) and dextrose solutions. The 
optimum concentration of the active ingredient(s) in the chosen medium can be 
determined empirically, according to procedures well known in the art, and will 
depend on the ultimate pharmaceutical formulation desired. Methods of 

30 introduction of exogenous polypeptides at the site of treatment include, but are not 
limited to, intradermal, intramuscular, intraperitoneal, intravenous, subcutaneous, 
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oral and intranasal methods. Other suitable methods of introduction can also include 
rechargeable or biodegradable devices and slow release polymeric devices. The 
pharmaceutical compositions of this invention can also be administered as part of a 
combinatorial therapy with other agents. 
5 This invention also has utility in methods of treating disorders of reduced 

sperm count or enhancing/increasing sperm count and/or sperm activity. Reduced 
sperm count can be increased, for example, by administering a drug or agent that 
enhances the activity of a reproduction-specific gene or genes, with the result that 
sperm count is enhanced. Alternatively it can be used in a method of gene therapy, 

10 whereby the gene or a gene portion encoding a functional protein is inserted into 
cells in which the functional protein is expressed and from which it is generally 
secreted to remedy the deficiency caused by the defect in the native gene. 

The invention described herein also has application to the area of male 
contraceptives. Variant reproduction-specific genes indicative of infertility can be 

15 used to design agents which mimic the activity of the altered gene product(s). Thus, 
the present invention also relates to agents or drugs, such as, but not limited to, 

— —peptides or small-organic molecules which mimic the activity (efFects)"of the vmant" 
gene product(s) of reproduction-specific genes (a variant reproduction-specific 
protein) of the present invention shown to be present in infertile men, but not in 

20 fertile men. One embodiment of this invention is a method of contraception (a 
method of reducing sperm production and/or sperm activity) in a man, comprising 
administering to the man an agent that mimics the effects of a variant reproduction- 
specific protein in the man, whereby sperm production, sperm activity or both are 
reduced (and preferably abolished) in the man. 

25 Alternatively, the agent or drug is one which blocks or inhibits the 

expression, activity or function of the reproduction-specific gene (e.g., an 
oligonucleotide or a peptide which blocks or inhibits the expression, activity or 
function of a reproduction-specific gene present in nucleic acid molecules of fertile 
men). The ideal agent will enter the cell, in which it will block or inhibit the 

30 function of the gene, directly or indirectly. Alternatively, an agent or drug can 
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inhibit the activity or function of one or more proteins encoded by reproduction- 
specific nucleic acid molecules. 

Reproduction-specific nucleic acid molecules described herein, such as those 
that encode proteins which have enzymatic activity, are potential targets of such 

5 blocking agents or inhibitors, as are the encoded proteins. For example, Spgl 7, 
which encodes a transketolase, and its human homologue; Spg25, which encodes a 
deubiquitinating enzyme, and its human homologue enzyme; Spg65, which encodes 
a RNase inhibitor, and its human homologue; and Spg85, which encodes a tyrosine 
protein kinase, and its human homologue can be targets of inhibitors, as can the 

10 encoded proteins. Agents that inhibit the gene, directly or indirectly, and/or the 
encoded product, directly or indirectly, are potential contraceptive agents. Agents 
that inhibit the gene, directly or indirectly, and/or the encoded product, directly or 
indirectly, are potential contraceptive agents. 

Identification of a blocking agent or inhibitor of a reproduction-specific gene 

15 or an encoded product can be carried out using known methods. For example, a 
gene for which an inhibitor is to be identified can be expressed in an appropriate 
host cell (e.g., mouse or human cell lines), in the presence of an agent or drug to be 
assessed for its ability to block or inhibit a reproduction-specific gene(s) (a candidate 
drug). The ability of the candidate drug to do so can be assessed in several ways. 

20 For example, its effect on expression of the gene (e.g., by determining if the gene 
product is present in the host cells, by immunoassay or Western blot) can be 
assessed. Alternatively, binding of the candidate drug to the reproduction-specific 
gene or to the encoded protein can be assessed, as can degradation or disruption of 
the gene or the encoded protein. For example, hSPG25 has two catalytic domains 

25 (Cys domain and His domain) that are conserved within the ubiquitin specific 

protease family (Usp) members. In a bacterial assay (Baker et aL, J Biol Chem 267, 
23364-75 (1992)), the enzyme encoded by hSPG25 might cleave the Ub (ubiquitin) 
moiety from the substrate Ub-Arg-P-Gal, a fusion protein of Ub and E. coli P 
galactosidase linked by an arginine. E. coli expressing Ub-Arg-P-gal only will form 

30 blue colonies in the presence of its chromogenic substrate X-Gal. A 

deubiquitinating enzyme, like hSPG25, introduced in£. coli would cleave Ub-Arg- 
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P-Gal into Ub and Arg-P-Gal, which is an unstable protein, thus forming white 
colonies. A candidate drug would block the deubiquitinating activity of hSPG25. E. 
coli expressing both Ub-Arg-p-Gal and hSPG25 should form blue colonies in the 
presence of X-Gal and the candidate drug. 
5 The present invention also relates to antibodies that bind a protein or peptide 

encoded by all or a portion of the reproduction-specific nucleic acid molecule, as 
well as antibodies which bind the protein or peptide encoded by all or a portion of a 
variant nucleic acid molecule. For instance, polyclonal and monoclonal antibodies 
which bind to the described polypeptide or protein are within the scope of the 

10 invention. In a specific embodiment, this invention relates to antibodies (polyclonal 
or monoclonal) that bind a protein or peptide that is associated with or indicative of 
infertility in men (a variant protein or peptide). Such antibodies can be used, alone 
or in combination with antibodies that bind proteins or peptides encoded by 
reproduction-specific nucleic acid molecules found in fertile men, in immunoassays 

15 carried out to diagnose or aid in the diagnosis of infertility. 

Antibodies of this invention can be prod uced using kno wn met hods. An 
animal, such as a mouse, goat, chicken or rabbit, can be immunized with an 
immunogenic form of the protein or peptide (an antigenic fragment of the protein or 
peptide which is capable of eliciting an antibody response). Techniques for 

20 conferring immunogenicity on a protein or peptide include conjugation to carriers or 
other techniques well known in the art. The protein or peptide can be administered 
in the presence of an adjuvant. The progress of immunization can be monitored by 
detection of antibody titers in plasma or serum. Standard ELISA or other 
immunoassays can be used with immunogen as antigen to assess the levels of 

25 antibody. Following immunization, anti-peptide antisera can be obtained, and if 
desired, polyclonal antibodies can be isolated from the serum. Monoclonal 
antibodies can also be produced by standard techniques which are well known in the 
art (Kohler and Milstein, Nature 256:4595-497 (1975); Kozbar et al. y Immunology 
Today 4:72 (1983); and Cole et aL, Monoclonal Antibodies and Cancer Therapy, 

30 Alan R. Liss, Inc., pp. 77-96 (1985)). Such antibodies are useful as diagnostics for 
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the intact or disrupted gene, and also as research tools for identifying either the intact 
or disrupted gene. 

As described in Example 2, chromosomal mapping of the genes described 
herein demonstrated the surprisingly large number of genes on sex chromosome X. 
5 This is the strongest evidence to date in support of the population genetics theory 
first suggested by R. A. Fisher and formalized by W. Rice. (Fisher, R A., Biol. Rev. 
6, 345-368 (1931); Rice, W., Evolution 38, 735-742 (1996); Hurst, LD. and J.P. 
Randerson, Trends Genet. 15, 383-385 (1999)). This theory argues that sexually 
antagonistic traits (beneficial in one sex, but detrimental or neutral in the other) on 
10 chromosome X tend to be strongly selected and, therefore, accumulate. Male germ 
cell-specific genes are only expressed in males and are, therefore, sexually 
antagonistic genes. The work described herein has resulted in identification of a 
number of testis-specific genes on chromosome X in both mice and humans. 

In 1922, JBS Haldane observed that when in the offspring of two different 

15 animal races one sex is absent, rare, or sterile, that sex is the heterozygous sex (XY 

QtZW)_(Halto 

are sterile and female (XX) are fertile. This rule is obeyed in all animals: 
lepidoptera, birds, flies and mammals. The significance of this is the early stage in 
speciation, known as the origin of species. Haldane's rule incorporates the 

20 following in his hypotheses: incompatibility between X- and Y linked genes, meiotic 
drive, disruption of dosage compensation, X-autosome translocation, dominance 
theory, faster-male theory and faster -X theory. The two assumptions made are that 
there are an abundance of "speciation genes" on X chromosome and the rapid 
evolution of "speciation genes". The result of the male sterility is reproduction 

25 isolation and the origin of two species. 

Hybrid male sterility in mice has been mapped to Hst-1 and Hst-3 locus (Forejt J. et 
al, Mammalian Genome 1:84-91(1991); Matsuda Y. et a/., Proa Natl. Acad. Sci. 
USA 88:4850-4954 (1991)). In one study, the species Mm. musculus crossed with 
Mm. domesticus, the male sterility mapped to chromosome 17 t-complex (Hst-1 

30 locus) and resulted in meiotic arrest of the spennatagonia. The X-Y dissociation and 
autosomal dissociation are high and the nature of the defect is genetic. In the other 
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study, Af. spretus crossed with M.m.domesticus resulting in male sterility mapped to 
chromosome X distal end producing meiotic arrest of the spermatagonia, The X-Y 
dissociation is high/low, the autosomal dissociation high/low and the mature of the 
defect may be structural. 

5 The present invention is illustrated by the following Examples, which are not 

intended to be limiting in any way. The teaching of all references cited herein are 
incorporated by reference in their entirety. 
EXAMPLES 

Example 1. Isolation and Cloning of Reproduction-Specific Genes from Mice 
Isolation of Mouse Spermatogonia. 

Spermatogonia were isolated by the Staput method of sedimentation velocity 
at unit gravity (Bellve, A.R.. Methods Enzymol 225, 84-1 13 (1993)). Primitive type 
A spermatogonia were prepared from testes of 6-day-old CD-I mice (Charles River 
Laboratories)._Mature_type A and type B spermatogonia-were isolated- from-8-day^- 
old CD-I mice. By microscopic examination, at least 85% of the cells in the 
resulting preparations were spermatogonia, with no more than 15% somatic cell 
contamination. 

cDNA Subtraction. 

Three independent subtraction experiments were carried out using cDNAs 
20 from primitive type A, type A, or type B spermatogonia as the tracer. In all cases, 
tracer and driver cDNAs were derived from oligo(dT)-selected RNAs. Germ-cell- 
depleted testes were from w v /w v animals. Prior to subtraction, tracer and driver 
cDNAs were digested to completion with Rsa L In each of the three experiments, 
we carried out one round of subtraction was performed using the "PCR-select" 
25 protocol (Clontech)(Diatchenko, L. et al Proa Natl Acad. ScL USA 93, 6025-6030 
(1996). To more thoroughly subtract ubiquitous cDNAs, four additional rounds of 
subtraction were performed using a modified procedure (Douglas Menke, Whitehead 
Institute, personal communication) as described in Lavery, DJ.,et all Proc. Natl 
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Acad, ScL USA 94, 6831-6836 (1997). Between rounds of subtraction, enrichment 
ofDazl cDNA (germ-cell-specific) was monitored and disappearance of G3PDH 
cDNA (ubiquitous) was monitored. Three plasmid libraries (one for each of the 
three independent experiments) were prepared from the resulting pools of subtracted 
5 cDNA fragments. 800 randomly selected clones from each of the three libraries (one 
read only) were sequenced. Of the 2400 sequences generated, 165 were of poor 
quality or derived from the cloning vector, leaving 2235 sequences for further 
analysis. 

1 0 Sequence Analysis. 

Of the 2235 sequence fragments, 409 corresponded to 13 previously reported 
germ-cell-specific genes (142 to Mage, 1 1 to Ubely, 2 to Usp9y, 44 to Rbmy 9 10 to 
Tuba3/Tuba7, 2 to Stra8, 45 to Ott, 16 to Sycp2 9 3 to Sycpl, 3 to Figla, 8 to Sycp3, 
21 to Ddx4 9 and 102 to Dazl). Among the remaining 1826 sequence fragments, each 

15 was searched electronically for redundancies and identities to known genes. 98 
unique, novel sequence fragments were found that were each recovered at least 
~twicerEach7of tfiese"98 sequences was tested for germ cell specificity by RT-PCR 
on 14 tissues. Of the 98 sequences, 45 were found to be expressed in spermatogonia 
and wild-type testis, but not in somatic tissues including w v /w v testis, indicating that 

20 they are germ cell specific. After full-length cDNA sequences were assembled, 
these 45 sequence fragments were found to derive from a total of 23 different genes. 
Of the original set of 2235 sequence fragments, 546 corresponded to these 23 novel 
genes (8 to Fth!17; 29 to Usp26; 38 to Tktll\ 66 to Texll\ 2 to Texl6\ 132 to Tqf2q; 
57 to Pramel3; 13 to Nxf2; 5 to Texl3\ 4 to Pramell; 3 to Texl 7; 2 to Stk31\ 6 to 

25 Rnh2\ 29 to Texl 2; 4 to Texl 8; 2 to Texl4\ 8 to Rnfl 7; 16 to Piwil2; 36 to MovlOU; 
7 to Tex20; 71 to Texl5\ 6 to Texl9\ 2 to Tdrdl). 

cDNA Cloning. 

Full-length mouse cDNA sequences were composites derived from 
subtracted cDNA clones, 5' and 3' RACE products, and clones isolated from 
30 conventional cDNA libraries that were prepared from adult testes (Clontech, Palo 
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Alto, CA; Stratagene, La Jolla, CA; and one library of our own construction). 
Orthologous human sequences were identified by searching GenBank using mouse 
cDNA sequences. Full-length human cDNA sequences were obtained by screening 
a cDNA library prepared from adult testes (Clontech). 

5 RH Mapping. 

Using PCR, genomic DNAs from the 93 cell lines of the mouse T31 

radiation hybrid panel (Research Genetics, Huntsville, AL) were tested for the 

presence of each gene (McCarthy, L.C. et al, Genome Res. 7, 1153-1161 (1997). 

PCR conditions and primer sequences have been deposited at GenBank, where 
10 accession numbers are as follows: Figla, G65193; Magea5, G65194; Ddx4, G65195; 

Ott, G65196; Sycp2, G65197; Sycp3,G65m; Stra8,G65\99; Tubal, G65200; 

Tubal, G65201; Fthll 7, G65202; Movl 0//.G652O3; Nxf2,G65204; PMI2 t G65205; 

Pramell, G65206; PrameB, G65331; RNF1 7, G65207; Rnh2, G65208; Stk31, 

G65210; Tajlq, G65211; Tdrdl, G65212; Texll, G65213; Texl2, G65214; Texl3, 
15 G65215; Texl4, G65216; TexlS, G65217; Texl6, G65218; Texl7, G65219; Texl8, 

G65220; Texl9, G65221; Tex20, G65222; Tktll, G65223; and Usp26, G65224. 

Analysis of the results positioned the genes with respect to the radiation hybrid map 

of the mouse genome constructed at the Whitehead/MTT Center for Genome 

Research (Van Etten, WJ. et al.,. Nature Genet 22, 384-387 (1999) (www- 
20 genome.wi.mit.edu/mouse_rh/index.html). Chromosomal mapping data of human 

genes were retrieved from GenBank and confirmed by RH mapping using the 

GeneBridge 4 panel (Research Genetics). 

Expression Analysis. 

25 RT-PCR conditions and primer sequences have been deposited at GenBank, 

where accession numbers for mouse genes are as follows: Gapd, G65758; Fshr, 
G65759; Dazl, G65760; Rbmy, G65761; Fthll7, G65778; MovlOll, G65779; Nxf2, 
G65780; Piwil2, G65781; Pramell, G65762; Pramel3, G65782; Rnfl7, G65763; 
Rnh2, G65783; Stk31, G65784; Taf2q, G65785; Tdrdl, G65786; Texll, G65787; 

30 rex/2, G65788; TexlS, G65789; Texl4, G65790; Texl5, G65791; Texl6, G65792; 
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Texl7, G65793; Texl8, G65794; Texl9, G65795; Tex20, G65796; Tktll, G65797; 
Usp26 9 G65798. Accession numbers for human genes are as follows: FTH1, 
G65764; FIHL17, G65765; MOV10L1, G65766; NXF2, G65767; 2WVF77, G65799; 
57X37, G65768; TAF2Q, G65769; 72MID7, G65770; TEX11, G65771; 7EX72, 
5 G65772; TEX13A, G65773; 7EX735, G65774; 7EH4, G65775; TEX15, G65776; 
C/SP25, G65777. 

Example 2. Isolation and Cloning of Reproduction-Specific Genes 

380 infertile men (217 azoospermia and 163 oligospermia) and 93 fertile 
males were screened for mutations in two X-linked genes (TAF2Q and TEX 1 1). 
10 The Klondike PCR-based subtraction protocol (Diatchenko, L. et al, Methods 

Enzymol. 303, 349-80 (1999); Diatchenko, L. et al, Proc. Natl Acad. Sci. USA 93, 
6025-30 (1996) and a modified subtraction protocol (modified by Doug Menke, 
personal communication) (Lavery, D J. et al, Proc. Natl. Acad. Sci. USA 94, 6831-6 
(1997), Yang M. et al, Anal. Biochem. 237(1):109-14 (1996); Ausubel, F.M. et al, 
1 5 Current Protocols in Molecular Biology (1997)) were used to generate a subtraction 
cDNA library for each type of spermatogonia. In detail, cDNAs synthesized from 
mRNAs of infertile males and fertile males' spermatogonia were subtracted against 
a mixture of cDNAs found in great excess derived from mRNAs of 1 1 different 
somatic tissues (heart, brain, lung, liver, skeletal muscle, kidney, spleen, stomach, 
20 thymus, skin and w7w v testis). w7w v testes are essentially devoid of germ cells 
(Geissler, E.N. et al, Cell 55, 185-192 (1988)). After subtraction, germ cell- 
specific genes are expected to be enriched and ubiquitous genes to be removed to a 
certain degree. The subtractions were successful, as demonstrated by the 
enrichment of Dazl transcript (germ cell-specific) (Reijo, R., et al, Genomics 35, 
25 346-52 (1 996)) and the disappearance of G3PDH transcript (ubiquitous, present in 
all the tissues). The subtracted cDNAs were directly cloned into a plasmid vector to 
make a subtracted cDNA library. A library was constructed from infertile men and 
fertile men. Clones randomly picked from each library were sequenced, using ABI 
370 sequencer (ABI, Foster City, CA). A total of 2300 sequences was obtained. A 
30 combination of different methods was used to obtain full-length cDNA sequences: 
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subtracted DNA sequencing, cDNA library screening of Stratagene and Clontech 
testis cDNA libraries (Stratagene, La Jolla, CA and Clontech, Palo Alto, CA) , direct 
RT PCR of testis cDNAs and sequencing, 5' RACE (rapid amplification of cDNA 
ends) (Ausubel, F.M. et aL, Current Protocols in Molecular Biology (1997)), 3' 
5 RACE, and direct screening and amplification of cDNA library subpools by PCR 
using one gene-specific primer and one vector-specific primer. 
To determine the germ cell specificity, RT-PCR assay (reverse transcription 
polymerase chain reaction) of each clone was performed on a panel of thirteen 
different tissues (heart, brain, lung, liver, skeletal muscle, kidney, spleen, stomach, 

10 thymus, skin and w7w v testis ) (Ausubel, F.M. et al, Current Protocols in Molecular 
Biology (1997)). These novel X linked genes are designated FTH1, FTHL17, 
USP26, TEX 11, TAF2Q, NXF2, TEX13A, TEX13B, STK31, TEX12, TEX14, 
RNF17,MOV10L1, TEX15 andTDRDl. 

The mutations in TEX 1 1 and TAF2Q were analyzed further The structure 

15 of the gene was assessed, TEX1 1 BAC's and sequence was screened, primers were 
chosen spanning each exon. Infertile men were screened and the two genes 
^"sequenced. Polymorphism and causality were distinguished by looking at normal 
male controls, nature of variants, study of maternal relative (linkage), conservation 
between mouse and human, and splicing in vivo. There were 33 mutations found in 

20 TEX1 1 , 12 in exons (4 silent) and 2 1 in intron. 2 1 were found only in infertile 
males (380 males), 1 found only in normal (fertile) males (93 males) and 1 1 
polymorphisms (found in both infertile and normal males). The variants of TEX 1 1 
are depicted in Figure 108. 

There were 15 variants found in TAF2Q, 7 in exons and 8 in introns. Of these, 5 
25 were polymorphisms( found in both infertile and normal males), 9 were found only 

in infertile males, and 1 was found only in normal fertile males. Figure 1 12 depicts 

the variants in TAF2Q. 

A combination of different methods was used to obtain full-length cDNA 

sequences: subtracted DNA sequencing, cDNA library screening of Stratagene and 
30 Clontech testis cDNA libraries (Stratagene, La Jolla, CA and Clontech, Palo Alto, 

CA) , direct RT PCR of testis cDNAs and sequencing, 5' RACE (rapid amplification 
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of cDNA ends) (Ausubel, F.M. et aL, Current Protocols in Molecular Biology 
(1997)), 3' RACE, and direct screening and amplification of cDNA library subpools 
by PCR using one gene-specific primer and one vector-specific primer. 
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CLAIMS 

What is claimed is: 

1 . An isolated reproduction-specific nucleic acid molecule comprising a nucleic 
acid molecule having a nucleotide sequence selected from the group 

5 consisting of 

(a) SEQ ID NOS: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 
33, 35, 37, 39, 41, 43, 45, 47, 49, 50, 52, 54, 55, 56, 58, 59, 61, 62, 
64, 66, 67, 69, 71, 73, 74, 75, 77, 79, 81, 83, 86, 87 and 89; and 

(b) the complements of SEQ ID NOS: 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 
10 23,25,27,29,31,33,35,37,39,41,43,45,47,49,50,52,54,55, 

56, 58, 59, 61, 62, 64, 66, 67, 69, 71, 73, 74, 75, 77, 79, 81, 83, 86, 
87 and 89. 

2. A nucleic acid construct comprising an isolated reproduction-specific nucleic 
acid molecule according to Claim 1 operably linked to at least one regulatory 

15 sequence. 

3. A host cell comprising a nucleic acid construct according to Claim 2. 

4. An isolated reproduction-specific nucleic acid molecule comprising a portion 
of a nucleic acid molecule having a nucleotide sequence selected from the 
group consisting of: 

20 (a) SEQ ID NOS: 1,3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 

33, 35, 37, 39, 41, 43, 45, 47, 49, 50, 52, 54, 55, 56, 58, 59, 61, 62, 
64, 66, 67, 69, 71, 73, 74, 75, 77, 79, 81, 83, 86, 87 and 89; and 
(b) the complements of SEQ ID NOS: 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 
23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 50, 52, 54, 55, 

25 56, 58, 59, 61, 62, 64, 66, 67, 69, 71, 73, 74, 75, 77, 79, 81, 83, 86, 

87 and 89, 
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wherein said portion is at least 14 contiguous nucleotides in length. 

5 . A nucleic acid construct comprising an isolated reproduction-specific nucleic 
acid molecule according to Claim 4 operably linked to at least one regulatory 
sequence. 

5 6. A host cell comprising a nucleic acid construct according to Claim 5. 

7. An isolated reproduction-specific nucleic acid molecule comprising a nucleic 
acid molecule which hybridizes under high stringency hybridization 
conditions to a nucleic acid molecule having a nucleotide sequence selected 
from the group consisting of: 

10 (a) SEQ ID NOS: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 

33, 35, 37, 39, 41, 43, 45, 47, 49, 50, 52, 54, 55, 56, 58, 59, 61, 62, 
64, 66, 67, 69, 71, 73, 74, 75, 77, 79, 81, 83, 86, 87 and 89; and 
(b) the complements of SEQ ID NOS: 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 
~Y3, 25, 27, 29,~31, 33, 35, 37, 39, 41, 43, 45,~47, 49, 50, 52, 54755", 
15 56, 58, 59, 61, 62, 64, 66, 67, 69, 71, 73, 74, 75, 77, 79, 81, 83, 86, 

87 and 89. 

8. A nucleic acid construct comprising an isolated reproduction-specific nucleic 
acid molecule according to Claim 7 operably linked to at least one regulatory 
sequence. 

20 9. A host cell comprising a nucleic acid construct according to Claim 8. 

10. An isolated reproduction-specific nucleic acid molecule comprising a nucleic 
acid molecule having a nucleotide sequence which is at least 70% identical 
to a nucleotide sequence selected from the group consisting of: 
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SEQ ID NOS: 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 
33, 35, 37, 39, 41, 43, 45, 47, 49, 50, 52, 54, 55, 56, 58, 59, 61, 62, 
64, 66, 67, 69, 71, 73, 74, 75, 77, 79, 81, 83, 86, 87 and 89; and 
the complements of SEQ ID NOS: 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 
23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 50, 52, 54, 55, 
56, 58, 59, 61, 62, 64, 66, 67, 69, 71, 73, 74, 75, 77, 79, 81, 83, 86, 
87 and 89. 

11 A nucleic acid construct comprising an isolated reproduction-specific nucleic 
acid molecule according to Claim 10 operably linked to at least one 
10 regulatory sequence. 



(a) 
(b) 

5 



12. A host cell comprising a nucleic acid construct according to Claim 11. 

13. An isolated reproduction-specific nucleic acid molecule which encodes a 
protein having an amino acid sequence selected from the group consisting of 

SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16; 18" 20, 22, 24, 26, 28, 30, 32734, 36, 

15 38, 40, 42, 44, 46, 48, 51, 53, 57, 60, 63, 65, 68, 70, 72, 76, 78, 80, 82, 84, 

85,88, and 90. 

14. A nucleic acid construct comprising an isolated reproduction-specific nucleic 
acid molecule according to Claim 13 operably linked to at least one 
regulatory sequence. 

20 15. A host cell comprising a nucleic acid construct according to Claim 14. 

16. An isolated variant reproduction-specific nucleic acid molecule comprising a 
nucleic acid molecule having the nucleic acid sequence of SEQ ID NO: 89 
having one or more alterations selected from the group consisting of A320G, 
T325A, C381T, G400A, A491G, G1282A, C1449A, T2219C, A2250T, 
25 T2295CandT2472C. 
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17. A nucleic acid construct comprising an isolated reproduction-specific nucleic 
acid molecule according to Claim 16 operably linked to at least one 
regulatory sequence. 

18. A host cell comprising a nucleic acid construct according to Claim 17. 

5 19. An isolated variant reproduction-specific nucleic acid molecule comprising a 
nucleic acid molecule having the nucleic acid sequence of SEQ ID NO: 50 
having one or more alterations selected from the group consisting of the 
alterations shown in Figure 1 12. 

A nucleic acid construct comprising an isolated reproduction-specific nucleic 
acid molecule according to Claim 19 operably linked to at least one 
regulatory sequence. 

21. A host cell comprising a nucleic acid construct according to Claim 20. 

22. An isolated protein comprising an amino acid sequence selected from the 
group consisting of SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 

15 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 51, 53, 57, 60, 63, 65, 68, 70, 

72, 76, 78, 80, 82, 84, 85, 88, and 90. 

23. An isolated protein comprising a portion of an amino acid sequence selected 
from the group consisting of SEQ E) NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 
22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 51, 53, 57, 60, 63, 65, 

20 68, 70, 72, 76, 78, 80, 82, 84, 85, 88, and 90, wherein said portion is at least 

7 contiguous amino acids. 



20. • 

10 



24. 



An isolated protein encoded by a nucleic acid molecule according to Claim 
1. 
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25. An isolated protein encoded by a nucleic acid molecule according to Claim 
4. 

26. An isolated protein encoded by a nucleic acid molecule according to Claim 
7. 

5 27. An isolated protein encoded by a nucleic acid molecule according to Claim 
10. 

28. An isolated protein encoded by a nucleic acid molecule according to Claim 
13. 

29. An isolated protein encoded by a nucleic acid molecule according to Claim 
10 16. 

30. An isolated protein encoded by a nucleic acid molecule according to Claim 

31. An antibody which specifically binds a protein according to Claim 22. 

32. An antibody which specifically binds a protein according to Claim 23 . 
15 33. An antibody which specifically binds a protein according to Claim 24. 

34. An antibody which specifically binds a protein according to Claim 25. 

35 . An antibody which specifically binds a protein according to Claim 26. 

36. An antibody which specifically binds a protein according to Claim 27. 

37. An antibody which specifically binds a protein according to Claim 28. 
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38. An antibody which specifically binds a protein according to Claim 29 . 

39. An antibody which specifically binds a protein according to Claim 3 0. 

40. An isolated protein comprising the amino acid sequence of SEQ ID NO: 90 
having one or more alterations selected from the'group consisting of W109R, 

5 V134I, G164R, N483K and V740A. 

41 . An antibody which specifically binds a protein according to Claim 40. 

42. A method of diagnosing infertility associated with alteration of a gene having 
a nucleotide sequence selected from the group consisting of SEQ ID NOS: 1, 
3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 

10 47, 49, 50, 52, 54, 55, 56, 58, 59, 61, 62, 64, 66, 67, 69, 71, 73, 74, 75, 77, 

79, 81, 83, 86, 87 and 89, and whose alteration is associated with infertility, 
comprising the steps of: 



a) obtaining a DNA sample to be assessed; 

b) . processing the DNA sample such that the DNA is available for 
15 hybridization; 

c) combining the DNA of step (b) with nucleotide sequences 
complementary to the altered nucleotide sequence of said gene, 
whose alteration is associated with infertility, under conditions 
appropriate for hybridization of the probes with complementary 

20 nucleotide sequences in the DNA sample, thereby producing a 

combination; and 

d) detecting hybridization in the combination, 

wherein presence of hybridization in the combination is indicative of 
infertility associated with an alteration of said gene. 
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43. A method according to Claim 42, wherein infertility is a result of reduced 
sperm count, reduced sperm motility, malformed sperm, or combinations 
thereof. 

44. A method of diagnosing infertility associated with alteration of a gene having 
5 a nucleotide sequence selected from the group consisting of SEQ ID NOS: 1, 

3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 
47, 49, 50, 52, 54, 55, 56, 58, 59, 61, 62, 64, 66, 67, 69, 71, 73, 74, 75, 77, 
79, 81, 83, 86, 87 and 89, and whose alteration is associated with infertility, 
comprising the steps of: 
10 a) obtaining a DNA sample to be assessed; 

b) processing the DNA sample such that the DNA is available for 
hybridization; 

c) combining the DNA of step (b) with nucleotide sequences 
complementary to the nucleotide sequence of said gene, whose 

15 alteration is associated with infertility, under conditions appropriate 
____ for hybri4ization of the probes with ^complementary nucleotide- 



sequences in the DNA sample, thereby producing a combination; and 
d) detecting hybridization in the combination, 
wherein absence of hybridization in the combination is indicative of 
20 infertility associated with an alteration of said gene. 



45. 



A method according to Claim 42, wherein infertility is a result of reduced 
sperm count, reduced sperm motility, malformed sperm, or combinations 
thereof. 
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Figure 1 

SEQ ID NO. :1 Spgl cDNA sequence 

acactcaacctccaagcgccgaaaaagaagtgaagagactgctctactcagacgctgaa 
gctgccagtgtccgctgggaagtcgtzgatgacgatgacgctaaggaaatagaaagtca 
agagtcc\ T GCCAACC\CTCGAGGGGCTCGAGGCCC\CCGACTCCCGGCAGGGCCAAGT 
CTCAGA^GACCCCCCGCCAC-GGCACTGCTCGCTGCCAGACCCTCGAGAGTGCCATGCGA 
TCCATGflGTGTAAGGCTTQAATGTCASGACGTTGAGG^CAGTT^TATTGCGTCTGCC 
TCCGGA2 C lAGCTT^ TGCTGTCAC^J^ATTATACATTCTAGAAATGCTGCTTGGAAGG 
ATAA? CTA? .AAATTGACTTTTCTCCTGATGGCCACCATGCGGTTGTTCAAGTAGACAAC 
GTCTC\CTGCCTGCTAAACTGGTTAATCTGCCTTGTGTTATCGGAAGCCTGAAAACTAT 
TGAC^GAAAGACATTTTATAAGACCGCGGATGTTTCTCAGATGCTTGTATGCAGTCCTG 
AAGGTGAGCCTCATTCTCCTCCTGAAGAACCAGTTGTCTCTACTGGTCCTACTGTAATT 
GGAATTAGTGAAGGGAAGGCAGAGAGAAAAAAATATAACTGGAAGCATGGCATTACTCC 
ACCACTTA^GAATGTCAGAAAGAAAAGGTTCCGGAAAiVCVACAAAASAGCTCCCAGATG 
TGAAACAAGTGGATGAAATCAACTTTAGTGAGTACACTC^TCTCCAAGTGTGGAAAAA 
GAAGTGAAGAGACTGCTCTACTCAGATGCTGAAGCTGTCAGTGTCCGCTGGGAAGTCGT 
TGATGATGATGATGCTAAGGAAATAGAAAGTCAAGGGTCCATGCCAACCACTCCAGGAA 
TCTCACAGATGGGTGGTGCTAGTTTATCAGACTATGATGTGTTTCGGGAGATGATGGGT 
GATTCTGGCAGCAACAGTAATGATGTGGAAGAGAAGAGTAATGAAGGTGACGACGATGA 
TGATGAAGATGAAGATGATGAAGACTATGGAAATGAAAAGGAGGAGGAAGAGACAGACA 



ATGGA£ ^GAGAAGTGTGAACAGATTTATCACCTCCAGeAAi-AKUiw^.^- 
GAAGGAGTAAggaaagccccagcctggcacaccaagtggattcaccctccagatgaagc 
ctgggtggaaccacatacttctgeccctctactttatcttaaacaacttttatttgttg 
agcatttttcactaaagtaaatttaaggatcacatttatataggagacatatatagagg 

-gagCtaeataaatgca-taggt't-tagagacccac 

ttgagattaaatgtg'tgtatttctttatctcttactctacggtaccacgagatcattca 
gccgtccttgtcaaaggtt-aggctagaaagatacacagctgtttcacataagcccac. 
tfctcaaacctggttttaagtatttcctacaattttgaaaatgatcaaatcgtcagctgg 
caatcccagcaatccaaaaggcagaagcagaaggtcaaattgaaggccagcczgggctg 

ta ta taacac cczgtc ccaaaataaaa taaactgaaaac caaaaaaaaaa 

Figure 2 

SEQ ID NO.: 2 Spgl encoded protein setjuence * TnTI , ^ - 

MP' T ^GARGP?TPGRAXSQK?PRQGTARCQTLESAMRSMSVRLECKDVEEQ^ ^ 

QA^yRKJIHSPJJAAWKDKLKIDFSPDGHF^^^^^^ 

SrFYKT^SQMLVCSPEEEPKSPPEEPWSTGP^ 

KNVP^PJCTTKKLPDVKQVDEINFSEY^ 




DEDDEDYGNEKSEEETDNSEEEL£KELQAis_r Mtf bLttiAuyuiaoi !?«Brtr rrcrrB 
RLQMIYIC<AQRQKELL?JC\/ENLTLKREFQNVLGKLNI2eKE 



Figure 3a 

SEQ ID NO.: 3 Spg2 cDNA sequence - „_, n , rr 

GAGACGGAGACC^.GTCrc-AGACGCGAGAC 

AGACGCC^CCCTG^^AACAACTTTCTCCTGGATTTGCAGCTTCCTCAACGTCCCTGCA 
C^TTCAGGCTGGAGCCAGACATTTAAAAA^TGGACCGCA 
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GAC TTCAGAGAATCTGTTAAAA.CC CTGATC ATAACTGGTAATTCATGG AG AC T ACAAGA 

AATGATTGACAGATTCTTCACAAACATATCAAATTTCAACAGAGAGTCTCTGACTGAAA 

TACAGAATATTCA.GATTGAAaAAATTGCAGTGAACCTGTGGAACTGGGCAGTTACTAAG 

AGAGTAGAACTGTCTGTGAGGAAAAACCAGGCAGCTAAACTGTGTTATATTGCTTGCAA 

GCTGGTATAT ATGC ATGGAATC TCAGTC TC TTC AG AAGAAG C TATTCAAAG ACAG ATTT 

TGATGAATATAAJ\AACAGGAAAAGAGTGGTTGTATACTGGAAATGCTC\GATTGCTGAT 

GAATTTTTTCAAGCTGCCL^TGACTGATCTGGAGAGATTATATGTCAG 

CTGCTACAC C GAGGC CAAC GTGTCTGTGTATAAGATGATTGTTGAG AAAGGC ATCTT C C 

ATGTGCTTTCTTACCAAGCTGAGTCAGCTGTTCCTCAAGGGGA 

C TG TGCGTC TT ACGTTGCAAAG AT ATGC TG ATGAGAC TC C C T AACATGACAAAAT ATC T 
TCATGTACTCTGTTACA^CCTTGGCATAGAAGCAAGCAAGCGGAATAAATACAAAGAGA 
GTTCATTCTGGCTTGGCCAAAGCT^ 

CCACAAATGCTG^TAAAACGCTGCGGTTACTAGCCACTATTTATTTC^TTGTGGTGG' 
CGAAGCATATTATACCAAGGCCTTCATTGCTATACTCATTGCAAACAAGGAACA^ 
ATC CAGCTGGGCTTTTCTTAAAGATGAGGATC CTCATGAAAGGC AACTC ATGT AATGAA 
GAACTC CTTGAAGC TGC TAAGGAAATACTATATCTTGCTATGCC TTTGG AATT C T ATC T 
GAGCATTATTCAATTCCTGATAGATAATAAAAGAGAGTCTGTTGGGTTTCGCTTTCTGA. 
GAATCATCTCTGACAATTTTAAGTCGCCAGAAGATAGGAA 

ATTGACACGCTTTTACAAAAGGATCAAGACATGATTGCTGAAGAGAAGATTAAAGACGT 
CCTTAAAGGTTACCAAACAAGAAGTCGACTGTGAAGAGATTTGGTAAATTGGTTACACA 
AC ATTCTGTGGGGAAAGGCTTC CAGAAG TGTTAAGGTC CAAAAATATGC TG ATGCC C TA 
CACTGGTACAGTTATTCTCTGAAG TTG TATGAG TATGATAAAGC AGATC TGGATTTGAT 
GAAGCTGAAGAGGAACATGGTTTCCTGTTACTTATCTTTGAAACAAC 
AAGAGGCC\TAGO\GAAGTTGAGCAAAAGGA 

ATATTCAAGATCGCAATCATCGAGGGTGATGCTTTCAGAGCTTTACL^GGTCGTCAGTGC 
TTTAAAGAAATCATTAATGGATGGAGAATCAGAAGATCGTGGACTAATTGAAGCTGGAG 
TTTCAACTC TC ACAATC CTAAGTTTATC TATAGATTCT CAA 
TTTGTGGCAGAAAGAGCTTTCGAATATTTATGTC 

ACTTGGAGGTTTAAAGTGTCTCATGO^ 

AATCTCAATATAAAAA(iArJ\GAAATGGGTAGACTTTGGAACTACTTGAA^ 

CTGAAATTTTCTGAJITATTTTAATGAAC-CTCCCTCAACTTTC-GATTA 

TGCCAATTGGTTCAGGAAAATAGCTTGGAACTTAGCTGTGCAATCTGAGAAGGATCTAG 

AGGCA^TGAAAAACTTTCTCATGGTTTCTTATAAGCTCTCCCTTTTTTGTC 

CAAGGACTACTGATTGCACAGAAAACGTGTTTACTTGTAGCAGCTGCAGTTGATCTGGA 

TAGAGGAAG AAAAGC TCCAAC AATTTG TG AGC AGAACATGTTACTAAG AAC AG CAC TTG 

AGCAGATAAAGAAATGCAAAAAAGTTTGGAATC TCCTGAAAAAAACAGGGGAC TTCTCA 

GGTCATGACTGTGGGGTATTGCTTCTGCTCTATGAATTTGAAGTTAAAACCAAAACGAA 

TGATCCATCACTGAGCAGATTTGTGGATTCAGTTTGGAAGATGCCTGATTTAGAATGCA 

GAAC ACTTCAAACVATGGC ATTACTAGCT ATGGATAAACCTCCATAC TATC CTAC T ATT 

GCACATAAGGCCATGAAAAAACTTTTATTGATGTACAGA^ 

TTTAAAATACAGCGTATGCATGCACAACTTGATTAAACTTCTGGTGGCAGATGAAGTAT 

GGAATATATCGCTGTATCCCCTAAAAGAAGTTCA-GAGCC^TTTTAAAAATACTC 

ATCA.TTCGCCAAAACGAAGGATACCCAGAAGAGGAGATTGTATCGCTAATGATCAAGTC 

TTGGAATATTGGAATACTCATGTCTAGCAAGAACAAGTATATATCTGCAGAAAGGTGGG 

CTGCAATGGCATTGGATTTCCTTGGCCACCTTAGCACCCTCAAAACAAGCTATGAAGCA 

AAGGTCAJYTCTTCTCTATGCCAACCTCATGGAAATATTAGATA^ 

ATCTACAGAGATCACTGAACAATTAAGAGCACTTATTGTTCCTCCGGAGGATCAAGGTT 
CAGTTTCCAGC ACC AACGTGGC AGC TC AAAACCATCTGTAATTC CAGTTCC AGGGAATC 
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Figure 3 c 

TGATACTCCCCTCTGGTCACTGTGGGTACATGTCATAftAAGTAAAGAGTGATTTTACTT 
TTC\GTGGAGAGTCATAATGATTGAAGGAGGTTGTAGAAAGCAGACTAGCTCACAGCAA 
• CTTGGAAGGTGTAT ATTCATGAGCCGCTGACTATTTCAGTGG TGG^TCA 

Figure 4 

SEQ ID NO.; 4 5pg2 encoded protein sequenc^ 

MDRITDFYr LDFRESVKTLIITGNSWRLQEMIDRFr TNI SNFNRESIiTE IQNIQ I2E IA 

VNL WNWAVTKRVEL SVRKNQ AAKLCY I ACKLVYMHG I SVS SEEAIQRQ ILMNIXTGKEW 

LYT<^QIADEFFQAA3yriT)LERLYVRIiMQSCY 

VAQGDFKKASLCVLRCIODMIiMRLPNMTKYti^ 

IGKMDRRSVEPQMUAKTL 

ILMKGNSCNEELLEAAKEILYLAMPLEFYL^ 

EDRiOlILLFYIDTLLQKDQDMIAEEK 

VKVQKYAD ALHWYS YSLKLYEYDKADLDLI KLKRNMVS C YL S LKQUDKAKEAI AEVE QK 
D PTHVFTRYYIFKI AXMEGDAFRALQVV S ALKKS LMDGES EDRGLIEAGVSTLTILS L S 
IDFALENGQQFVAERALEYLCQLSKDPKEVLGGLTC 
RLWNYLOTALLKFSEYFNEAPST^ 

YKL SLFCP LDQGLL IAQKTCLLVAAAVDLDRGRKAPT ICEQNMLLRTALEQ IKKCK2CVW 

NLLKKTGDFSGDIXGVLLLLYSFEVK^ 

MDKPAYYPTIAHXAMKKLLI^^^ 

VQSKFKNTLSIIRQNEGYPEEEIVWLMIKSWN^ 

L STLKT S YEAKVNL LYANLME I LDKICTDLRS TEMTEQLPAL I VP PEDQG SV£ STNVAAQ 
NHL* 

Figure 5 a 

SEQ ID 170. : 5 Spg3 cDNA sequence 

cgctggcgtttgaagagagcacgaggaacggctcagtttcctggagactagacagagcg 
agcgagcagccgttcagtgtagtcgttggtgcaATGTGGTCCTCTCCAAAAGAGAATCT 
GCAAGGCAGGTCTC^CATGTTTGTTCAGAAGAACATTAAT^ 

GATATGGTTTACCATACAAGAGATCTGAGAGATTCTATCATTCLAGAATACAAAATGAAT 
TATAACCATGGTTTTCAAGGAAGAAAGAGAGGTGTGAATTATATCTGGAGTCAATTTC 

CAGA^GAACAATCATTTTGATC^TTATGGTGCTCCATATGCCATGG 
GGAGAGAAAGATGCAGTTATGATGACCAATATTTTCTTA^TGTGTGGGATGATAGTAAA 

ACTGAGGAGGGCGAAACAGATCTGGATGCTGAAAATGAAACTC^ 
GGTCACAATTCCCAGTGGAJ^GAAAGTATCAGAAG 

ACTTCTGTAGTGAGCCCTTCATCCCTGTTGATTTCCACTATGACAAAACCCAGGCCCGG 
TTCTTTGTTCAGAATGCTAAGACTGCCTCTGC^ 

TGATGAAACAAGCAGAAAAATAGCGATCTTTGTC^GTCCTTCTGTTGTGCCCTATTCTC 
TGCAAAACAAGTTTACATCAGAACAAATGGAGTACAT^^ 

TATGACGCCTCCCAGAAAGCTCTGGACCTCGAAAAGTTCCGATTTGACCAGGACTTAAT 
GGACAAGGATATTGACATGATGCTGA^TCGAAGAAGCTGCATGGTTGCCACACTAC^GA 
TCATTC>J ^GTGATATCCCTGAACTC-TTGTCCTTGAACTTGAC 

CAGC TGGATGGGC TGTCAGAC ATGACAGAGAAGGCC CCTCAC GTTAAG ATC CTGAAC C T 
CTCCCGAJ^ATAAACTGAAGTCATTC^CGGAATTGGAG^ 

AAGAGCTGTGGCTGGAJVGGGAACCCCTTCTGCAACTGCTTCTTGGATO.TTTTGAGTA^ 
ATAAGTACTATTCATGACCTATTCCCCAAGCTGTTGCGTCTGGATGGCGAGGACATAAT 
AGTAC C AAAAAG AAATC TTC^G AATGGCAAGGGC CT AATAGT ACCGACA^GAAATCTTC 
AGAATGGCAAGGACTTAATAGTACCGACAGGAAATC^ 

GTACCGAC^GGAAATCCTCAGGATGGCAAGGACCTAATAGTACCAAC^GGAAJ^TCCTCA 
GGATGGCAAGGACCTAAT AGTAC CAACAAAkATGGACATCGAGGTC C CGC AAC C ATGCA 
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Figure 5b 

AGGAAAGCTGT AAC AC ATC TGAAGT C AT 

TACTACTTGTTTTATGACAJ^TGGAGATCGACTTCGTCTTCTCGATGCTTACCATGACCA. 
GGCCTGCTTCTCCTTGTCAGTTCCTTTCGA.TGTCAGTGACCCAAACCTGAACAA.CTTGG 
AAGAGTATTTCLWATAC^^ 

CAGTTGCTGAAGCACACAJ\AACATGACATCGTGAACTCTCT 

TCAGCATGATCTTTGCTCTTTCTTGGTGGACTTGTTTCTCCACACGGAAJ^TGATGCTCT 
GTTTTTCTGTG AATGGAC TA/TTC ATGGAAGTGGAAGG AAAATGTC GAG GC TGCATC C GT 
GCCTTCACGAC^ATCTTCATTGCTATCCCTTGCAGCGATTCAAGAATTTGCATCATGAA 
CGATGAGCTGATTGTGAGGAATGCCAGTC CCLAAAGAGATACAAAAGGC CTTCACCTCAT 
TGC CTGCACC C GAC AC GTCATTC AAGC C TTTGCTCTC TGAAG AACA.GC AGGAAATGGTG 
AAGTCTTTCTCTGTGCAGTCTGGAATGAAJVCTTGACTGGTCTCAGAAGTGCCTTCAAGA 
TAACGAGTGGG AC T AC ACCAAAGC TGGTGAGGC CTTC AC TGC TC TC C AGAATGAGGGCA 
AGATCCCAAAGGAATTCTTCAAATAAaaggatactaaagatgtcctgtggatagagtar 
tcctcctcatccacatracttcccrttataacgcccttcccacacctgggaatagagag 
aggcctttctgaccaagaagcaaaagttaacacgtaggccaagtaatataaccccczct 
ccccacattggcaatccttctgtctccctccaaagctgtttgtgatttcataataaaga 

grttctttacctaaaaaaaaa 

Figure 6 

SZQ ID NO- : 6 Spg3 encoded protein sequence 

MWS S PKENLQGRS SMFVQKNINSETYXQRYGL ? YXRSERFYHS EYKMNYNHGF QGRKRG 
VNYIWSQFDRKNNHFDHYGAPYAMGMKRRRERCS 

NETEEKWYKVT I P SGRKYEKTWLMRS IQNF C S E P F I P VDFHYDKTQ AilFFVQNAKT AS A 

LKDVSYRICDETSRKIAIFVSPSWPYSVQN^ 

KFRFDQDLMDKDIDMMLITRRSCM^ 

APEVKILNLSjOIEGjKSFTELEKVKEL 

LRLDGEDITVPXRNLQNGKGLIVPTRNLQN^ 

LIYPTGNPQDGKDLFvPTXMDIEV^ 

RLLDAYHDQAXIFSLSVPFDVSDPNLNI^ 

-NSLSLLPKTQHBLCS^ " 

SDSRIC IMNDEL IVRft AS PKEIQKAFTS IjPAPDTSFKPLL seeqqemvksf svqsgmkl 
DWSQKCLQDNFaTDYTKAGEAFTALQNEGKI PKEFFK 

, Figure 7a 

SEQ ID NO.: 7 Spg5 cDNA sequence 

ATGACATACTTTTTTATTTATGTTTCCACAGAAAGAGCATGTTCTCTGAATAACTGTAC 

AACCTGTGGATCCATTTGTTCAGGAAAATTGTCCATGTAAAGCACTAAATTaAGA 
* GGTCCTTTCAGC TC AG AT AC TTC TAGTTC TTATGG AAATGT ACAAJLATGGAAA-C AATTC 
TGTGCTTGA^GCATACAJVCAGACAGACAGAAAATTCATCAAATCTTAGAGATGCT^ 
AAGTATACACACACAATTCA.GGTTTTTCTTTCATACCCACTGGTAACACAGCAAGTGGT 

AATGGTG ACC TGTTC AGTGTG AC AT ATCTTAG AAGT ATTTTAAG T AGT ATTT C TGC TGC 
TTTTCC CTCTCACAACAATACTGGCTCAAGTAC AGTTATTAC TTCAAAACTC ATTAAGG 
ACCCAAGACTTATGAAGAGAGAAC^ 

AGTGATGTTTTGCCATTGGATAAGAGTTTGGGTTGTGGTGATTCAC^ 

ATGTATGC CAAC T AGTTCCATC TCTTCAT CGGAAGTTCC TGC TG ATAATACCATTAC T A 

GTTGTTTGAATGCCTCTTGCTTCAAATTCTCTTCT 

AATAGCAGCTC AAAGGGCCA.TGACTGTATAGCATC CAGTAGCATTGCTGTTAC AGAACA 
ATTTAAAGAGCAACACAGTTCTTCCTTCCCC^GTTCTTTATCAAATGCATTTTCAGATG 

TC^*JLACAAAJ\ACA^ 



WO 01/66752 



5/108 



PCT/US01/07371 



Figure 7b 
CCAGTTCTAACAGCTCTGAG 

TAGOJ\CGACT CTC\GGGTC ATTTCTCTCAAGAGTC AC CATCTTC TGAT AT AAACAGTA 
TATATAAGGTTGGTCACCL^GATGTCTACAGTCTTCCCAGCCCAGAAGAAAGGAAATCTA 
TGTGAATACATCCAAGATACGGGAATGATGAGAGCCTCCATCAGCACAGAAGACAGC^C 
TAAAJSATGGAGTAAACCATACTTGGTGCAAAGAi^^ 

GTAGC C CAATTGAT AATT C C AATACATTGTAC CAGGAAC ACAAAGAAGGAGGAAATCTT 
AATTC TTT AAGTGGTAATTGTGAAAAAATCGGAGTTAC TCATAAGTTAC AAGTGCC CAA 
GTTTCCCATATCTTCC^CAGGGGATAA^ATGAACTATATCGTGCAGCATTGGAATTAG 
AGTGTTCTC TTACTCC AAC TATAGAGTGTCTTTC AC AAAAGTACC C GC AAC ACT CTTTG 
GAGCATGAAGATAATACAAATTTTGCCATGACTCAAGGGCTAA^AGAATTAAAAACAG^ 
ACAAAATAATCAGAAC TTTGGTAAC ATTTTGTCTGATGCCTTC CAGG AAGCAAAAGATG 
TTCCCCTGGCCAGTGAAAAGCTCATTGATAGAGTTATTTCATCAGCTGCCATTGACATC 
TCTCTTGAGAGTTCAGTTTGCAACATAATTGGAGAATATACATGTGTCCGGAGGGAAAA 
TGAAAATGGGGAAGCATCAC GATATAACTGTCACAAAG AAGAAGC TTC TC GTGTTAAAG 
ATGGTGTGCAGGATCAC\GCCTATCTTATGATGCAGAATTGAGCTGTGATC"GAACTTG 
a J^ATTAACTTGCAAGAACAAAGAGATGATAAAAATCCAAACGAGGC 
TACAGATNACATAAATGGAAGTGAGAAACAAGATTGTCTTGCAAA 
ATATAGTTGAAATGAGGGAAA?TAAGAGTAACACTGAAGTAGAAATTCTGAATTCTGAA 

GMTGTTTCAC ATTT AACT C ATTTCGGGG AAAAAACGGTAAAC CAGCAG AAAC AGC ATC 
ATCAGAGAGTGAAGCTGTAGAACAAAGGCATGCACCAAATGATGAAAGAG^ 
ACTTGGTGTCCACATTTCCAGAAATTGAAGGCTCTTCAGTGTGTC 
ACAAAACAAA.TAGTTGGCACTACTGTCCTTACLAGTAA 

AAAAGATGAGTTAAAAGAAATTTGTTCCTCTGAGAGTTCAGATTTGGGTTTAGTAAAAC 
ACAGCATTTCTGAATGTGAAATTGAT^^^ 

TTGGTAAATGAGAATTCAGCTCTTAAAACTGGATTGGGAAGTGAAATTGAGGTAGATCT 
TGAACATGATAATGGTTCTGTATTTCAAGAAAATATGCAT^ 

GTGAAGAATTTGAGTTATATGAGTCTCTAAAGTCTCGGATTGATTGGGAAGGCCTGTTT 

OTAAGCAGTTATGAGGAAJ^TAGAATCCTCA^ 
GCATAGTTCTACAG^ 

ACAACCCAATTTTTCTTCCAGATCTAC>J^GTTACAATT^ 
ATCAGTCCCACTGATGAATCTTTAGAGTTGAAAGATAATTTOT 

ATCTACAGAACCAGAAACAAATAAGGMGGGAATGCTTCTGGATTTGGCATGTGCTCCC 

AACCTTCTGGAGAAAATTCAAGTTTTTCATGTGCAAATAAGT 

GAATCAGGAGATGTGAC^AAGTCTGAGAGTO^ 

ACATGTGGATCAAGGATCTGGAAAACCAAACAATGACTCTTTGTCTACTGAACCATCTA 

ATGTCLAGAGTAATCAATGATAAGAGCAAAT^ 

GATACTAGAAATAAAAAGGACATGCAATC^ 

TTCTT CC AGGGGTCAG AAC ATAGC C^T AAAG ACTT AAGGG AGC ATG AAAC TC AC GAGA 
AGAAGAGAAGGCCAAC AAGC CATGGCTC ATCTGACCGTTTCTCTTCCTTATCC CAAGGA 
CGAATTAAAAC ATTTTC GCAGTCAGAGAAGC AC ATTAGGA^ 
TAATGAAGCATCTTTATGTAAAAGCAAACATCTGTCCAGGAAA^ 
TTCACTTAAAAAAAGCCCATAGAAGAGTTCATACATCTTTGCA^ 
GGACAAAAGAGGAAGGGCCCATTACCAAAAGCATATGCAGTA^^ 

GGAAAGTTGTGATCATCAAGGTGATAGTTTGATGTCTGAAAGAAGATATTCTAAGCATT 
TTTTGTCCAAAJ^GAAAATATGACAGACAGGGAGATAAAAGAT 

GAGGAGTCATTGACCCCGGTA.TCAAAGCACAGATTATATAGAACAAACAGA.GAGAGGAT 
TGCAGAGTGCCTTTCTAATGAAGTCATGTCTGGGCATGTTTCCAGTAGTCTTACCACTT 
TCC ATGTGAG AGAATTTTGTGATGAAGAJICAGTTTCCAG AAC CAC AGTT.ACC TC TAGCT 
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Figure 7c 

TATACATCTCAGAGTATAAGTCA^ 
CTCTTCTGAACTTGAACATTTT^ 

CACTAACTGAAAAAGAATATCAGACACATACA^^ 

cttaaaaacc^tacaacacatagtattagggatatagcaaaagaatgtaattctgagga 

taaaac^gttctctgtgaaagcaatccagtgtatttaagtttcataaaag 

gtcatagtccagataaaagttatgattcaaat^ 

tcagttttaggctccaaj^aaagqcattttaagtgtt^^ 

ttgtgt atctgatggtgt^aaaagtgg agaagcaatttttcctatag^ a aagtgtacag 

ttcctatggagaccacatcaagc^ttccta^ 

attcctccggtctcatcaattctagtgacagctggagaggaagaatcttctgtagggga 

aaatggactcttcgatgtaaatgagaatgagatgaatattactatgcattctaaattag 

atctaacatcagtajictgaagaaagtaaaa7ttgtaagaaaaatatgaagaa 

tccaatgatagttctatgctattaaaggagaatataacgggtccot 

ggcaaaatac attgaggaj^gaaaaaattaggaaaattg agc aagcagtttacaaaa^ aa 

ttatt actg aagg atcacctattagttttaag tacaaaagtcaaaataagatc ctaaag 

gaaaaatcatttcatgttaacaagaaaataattacaaacaacttgactgattctc 

aagcattaaaaattctactgtagacacaattgc tttgaaag ac attc ct aatc agc tta 

aagaaagaaaggaagcagggcaaattaaagttaataacaactctcactctgactgtctc 

tccaagccagccattgtagaaacta^tcataggcctgttttacatgggaaccctaaagt 

tgctac tcttcag aaggaattaaaag a^c atc gc tc acc t aattacacatc tc atgt aa 

cagaactgtctcaaattttacagagagcagatgaagcagcatctcttcagattt'tagaa 

gaagagactaagctttgtcaaaatattctccctttatttgttcaagcttttgaaagaca 

gcaagaatgttcaattgaccaaatcctgatttcaagaaagct 

tgtggaataattgtagacttaaattgaaaccatgtgctgttcat^^ 

cagatggcaatggaaactattcaatttattgaa 

accaacattccgaagcttgctttggtatgatgagagcttgtacagtgaactgcttcgca 
ggccacgtggatatcaactgcagtccaatttctaccctggttttcaaggacgactaaaa 
tacaatgcattctgtgagttacagaattatcataatcagttagttgaattgt? 

"~ "aaclaaaaaaagaaaata^ttcatattacgc^ 

aatgtgaagccataa.tgaagc^ctattctgattcctttgacttttgtccttctgttcca 
tttgc c tgtggag ttaactttggagatagtttaggagacctggaaac c ttaag aa aaag 
cactc tgaagc tgatcagtgtac c tgggggc tct c c taaagt ccatt cc t acc c a.ggaa 
agaaagatc atttg tggatcattatagaaat agtctcc tcaaaggttagttttatcaag 
agcaatgaagaaataagtatcaaaatctgtcttta7 
tgctgcaaaaagtcttgtatggaaagaaaagagctgctctt^ 
aaaagaatagagaaatggaggaaataaatgagagtgctttt^ 
tatgatgtcttatctaaaggtttaaacaatgaacccactagtattgga 
tgctattattgcttccaaacaatccaci^ 

aagcttggc tttcatatccagatatttcttgtgttggtgagatactggatca agctaaa 

tctgc\gacctagaggagtta(l\c^cctcactc^ 

tttaaaaaaatactttcagatgctgc^gaagataac^tagata^ 

AAGAAAATGTTTTGGATATGC TAAGC^CCAC^lCCTGGGAGC AGTC ATTTT> A AGCCT 
GAAGCTATTGAJ3ATTTATATTGAAATTGTCATGATCTCAGAAACAATTCA 
AAATTTAAT AGC AAAG AAAC TGCAOAC C AG AG ATTTCGAGG TATGC TC TGG TTCGATT 
GGTCTCTTCTTCCTGAGCTAJITTGGCTGCC^ 

GACACCCA4ACACATTGC C TTTGGAAACTGGTAGAGACTGCAATTTC TGT CCTT AAGAA 
AGAGCTGGCTGTTATCTATGAATATGGTGAAGCTTCTAACTGTTCCTATGCTCTACA.TT 
TATTCTACAGAGA^CTTAAC^-AJ^ 
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Figure 7d 

tattcagtttcc\cgtatattgacttggtgccac a .tactgcatctgta^ttttgga^ 

cactgtggcagaattagaacataactacaagc^gttttttctattactcaaaaatgtaa 

tgtctgtccctcagaj^gattttggaaaaatggttcatattataaa^ 

attgaacatatgaagctgctaagtgctaaagatactaaattgtccactcatcttctctt 

tctccaaatgctgcgcaacaaaaggaatgct^ 

agac^cccgttacagaacctggggaggacagcagtcaacctggggtttctgagcagaca. 

cctcc^ggtac ag agtgcacagtaaaaaac atttcagactcc tctaaaaagcgac ctgt 

gactgcagacacatgtgaagtctctc\ggga*^^ 

ggaaa^aacaaaaggttaccatgaaagatgttggaaacatacag 

ccaagcactacaggatctcctcccaatgatgaaaacaaaataggatcaaattcctctga 

cagtctgaaaagcatctctgCatctccagaagtggtcaaaagacagagctcagtacttg 

gttcagtgtcac ctgctgaaagtgtacaag acacttgcacac c aaagtcagaaaggaaa 

gtagagccaac^acagcttacctgattctt^^^ 

ctcaaatgtcatagagaaaagaaatgggaattc^ 

aagactgtccttt agtaacttgtgac caaaaggatatagatgcc tc ttactcacc tgag 

cacacacctgcac aggagtcc cataaaacccctgtggatcacacacagatc tctccttc 

aaacctaacagcaggaaatgatgaccctcttgtgcctgatgcatctctgctctcagtgt 

ctgcttccgagtcagagaaggacgtttatttgagtggcacagactttc^ccatgaaaat 

aataaaatactaaatttgtctactgaagattgtacaggcaccagctctccagaacctgt 

gtgtatcaaggacaaaatttc tgtcctgcaagtagataaaacacagc ctataaaaagtg 

aatcgcc\aaaaaaagtatgactgatgctccaaatctcaatactgcaccatttggctca 

tatgggaactcagcccttaatgtgaatggaacagtacagcacactcactctgaacagaa 

ttcaaaagtcc tgactc^gaaagttggc ccatccaggaatatacctc c^cagtctgcat 

gttctccagtacataattcttctgcaca.ttcatttggaacttcatatccatactac 

tggtgtttctatcagtacagcagcagca^ 

gatgacaac^tatgagatacaaoigcctcctcctccagtgttgactacagttgcaagta 
ctgttcagagcacacatttca^tcgttca.tactctgaacattttagttactttcctgga 
- -cagccaca&gcaaattgctt^aaccga^ 

ttcttacaattac caacaaccagtttattcacagtttgcttctcatcaaccagtc ccac 
aggctacatatccttatccgcctaacccaggtgtgcctcctcaagttccttggacttat 
gctccatggcaacagaatccgtttctac gaaggc c ttaaaaataaatctcttcatac tg 
a aataa^-t gcaacttaagttttctca^gt aaaaaa 

Figure 8a 

SEQ ID NO.:8 Spg5 encoded protein sequence 

MTYFFIYVSTERACSLNNCTIAKRIGKGXDAWI^ 

GPF S SDTS S S YGNVQNGNNSVLZAYNRQTENS SNLRDASQVYTKNSGF SF I PTGNTASG 
NGDLF SVTYLRS I L S S I S AAF P SKNNTG S S TVT T S KL I XD P RLMKREQ SMINK SD TAGL 
SDVLPLDKSLGCGDSQIKLTCMPTSSI5SSEVPADNTITSCLNASCFKFSSESSHYQAK 
NS S SKGEDC IASS 51 AVTEQFKSQHS S SF PSSLSNAFSDVRKQKKSEEQVQRAQMRSNV 
E^TALSSESRNSDESSNTCSNDSQGEFSQESPSSDINSIYK^/GKQMSTVFPAQKKGNL 
CEYIQDTGMMRAS ISTEBSTKDGV1SIKTWC3CETVL SNETVSSP IDNSNTLYQ EKKEGGNL 
NSLSGNCEKIGVTHKLQVPKFPISSTGDKNELYRAALEIiECSLTPTISCLSQKYPQHSL 
EHELNTNF AMTQGL I ELKTVQNNQNFGNI LSD Ar QEAXDVPLASEKL IDRVI S SAAID I 
SLDSSVCNIIGEYTCVPilENENGEASPYNCHKEEASRVro 
KXI^QEQRDDKNPNEAKEKNTDXINGSEKQDC 

ECFTFNS FRGKNGXP.^ETAS SESEAVEQRHAPNDQRGL EHLV STF ? E IZGS SVCVASNA 
TKQ IVGTTVLTVS T S LGDHQKDELKE IC S SE SSDLGLVKHS I S ECE IDTDKDKLQDFHQ 
LVNENSALXTGLGSEIEVDLZHD^^ 
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GSSYEEISSSSFARPJEGTDQKSSTECNCVSFCSQDKRELKNPIFLPDLQ^/TITNLLSLR 
ISPTDESLELKDNFYKQWESTEPETNKZC^ASGFGMCSQPSGENSSFSCANKFGWSVQ 
ESGDVSKSESSKSSNSSKNTHVDQGSGXPNNDSLSTEPSIW^ 

DTRNKKDMQ SRS SKRTLKAS S SRGQNIANKDLREEETHEKKRRFTS KGS SDR? S SL SQG 

RIXTFSQSEKHIRNVLNILNNEASLCKSXHLSRK^ 

GQKRKGPLPKAYAVII-SJINrWESCDHQG 

EESLTPVSKHRLYRTNRERIAECLSNEVMSGHVSSSLTTFHVR3FCDEEQFP 
YTSQSISQLEYTNSIVGNESSSELEHFSETSGNMLDPKETLTEKEYQTHTQLCNSDSAK 
LKNHTTHS IRDIAKECNSEDKT^^ 

SVLGSKXKHIL S VD I YEQDNCVSDGVKSGEAIF? IEKCTVPMETTS S IPTENIASKSYT 
IPFVSSILVTAGEEESSVGENGLFDVNENEMNIT^ 

CNDSSI^LXENITGPSKRYMAiCYIEZEKIRKIEQAVYKKIITEGSPISFKyKSQ^ILK 
EKSFHVNKXIITJ^TDSm^ 

SKPAIVETNHRPVLHGNPKVATLQKELKEHRSPNYTSHVTELS^ 
EETKLCQNILPLFVQ AFERQQEC S IDQ IL I SRKLLVEQNLWNKCRLKLKPCLAVDTWVEL 
QMAMETIQFIENKKRFLEGKPTFRSLLWYDESLYSELLRRPRGYQLQSNFYPGFQGRLK 
YNAFCELQNYHNQLVEFLTETKKENNSYYALLKYKRQ INECEArMKHYSDCFDFC PSVP 
FACGVNFGDSLGDLETLRKSTLKLISVPGGSPKVHSYPGKKDHLWIIIEIVSSKVSFIK 

SNEEISIKICLYGLEHIYFDAAXSLV^^ 

YDVLSKGLNNEPTSIGLQEDAIIASKQSTLGSISNCRLNKAWLSYPDISCVGEILDQAK 

SADLEELQGLTLRCTDHLEILKOTFQML^ 

EAIEITIEIVMISETIHYLKNLIAK^ 

DTQTHCLWKLVETAISVLKi^^ 

YSVSTYIDLVPHTASVNFGNTVAELEHNYKQFFL^ IKVMKT 

IEHMKLLSAKDTKLSTHLLFLQMLRNKRNA^ 

PPGTECTVW^ISDSSKKRPVTADTCEVSQGKGOTT)TVP 

PSTTGSPPNDSNKIGSNSSDSLKSISASPEVVKRQSSVLGSVSPAESVQDTCTPKSESK 

VEPTDS LPDSLASLTEQQENSNVT EKRNGN S SVAJlTNDKiQDC PLVTCDQKB IDAS YS PD - 

HTPAQESHKTPVDHTQI S P SNLTAGM)DPL VPDASLL SV SAS Q S EKDVYL SGTDFHKEN 

NKILNLSTEDCTGTSSPEPVCIKDKISVLQVTDCT 

YGNSALNVNGWQHTHSEQNSKVLTQKVGPSRNIPPQSACSPVHNSSAH 

WCFYQ Y S S SNGT AVTHTYQ GMTTYE I QQ P P ? P VLTTVAS TVQ S THFNRSY S EHF S YF ?G 

QPQANSFNPGNGYFPSHTPVSYmQQPVYSQFASHQPVPQATYPYPPNPGVPPQVPWTY 

APWQQNPFLRR? . 

Figure 9a 

SEQ ED NO.: 9 Spgl3 cDNA sequence 

AGCAATGGCGGCAGAGGCTTCGTCGACCGGGCTGGCTTCCTGTCACCTAGTGGAGAGTA 
AGAGTGGAGC GC AGGGTGC CTC GGGGTGTC AGTGCACTCGGTGTGGAAGG AAGGTGTCC 
GTTGCCTCCGGTGACCACCACAAGTTTCCATGTGGACATGCCTTTTGTGAACTGTGCCT 
GTC AGC AC CTC AAGAATATAC C AC AAGT AAATGC\C TGACTGTG AGGTTC ATACAAC TG 
TC^GCATGAATCAAGGTCACTACCCAGTAGATGGCTTCATCGAGGAAGATTCTTCTCTG 
GAAGCCTTGCCACCGAAAATGGTAAATAACTGCTCTTCAGATCTTGAAAAGACAGTGGA 
CCAGCTAATTAATGATTTAGAACATTCATCCTCCATACATAGGAATGTTTCAAACCCAT 
CAGCTGTAATGTCGGAGACAGAAGAAATTGATGAAGCACTGAAGATAGCAGGCTGTAAT 
TTTaA^CAATTAAGTAATGCTATAAA^TGCTTGATAGCA 

AGAGACACACAGTCTAACAGAGGCTGTGGAGAAACAGTTTGATACACTTCTTGCTTCTC 

TTGATTCCAGGAAAAAGAGCTTGTGTGAAGAACTTATAAGG^ 

TCAAAATTAGTAACAGTTAAAAGCTA^^ 
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Figure 9b 

TATGAAGATAGCAAAAGAACTCAGATCTGCTCCTTCTCTGAGGACCTACTGTGACCTGA 
CTCAGATTATCCGGACTTTGAAGTTAACATOT 

ATAATTCCAAGGAACA.CCCCTAGGTTGGATATAAATTGCAGTGAGGCCATCTGCATGTT 
CL^GCAGTATGGGAAAGATTGAATTT^ 

AAGATGGACAGAATGTTCAAAJ^GAAATTTAATAATAGAi^AGGAACTCTGTTGTGATGTA 
TACTCATCACTAGAAAAGAAAAAGGTAGATGCTGCTGTCCTGACTGATGAAACACCTG^ 
ACCTCCTTTGCAAGCAGAGGCCCCTGACAGGCATTTAGAAGGGAAAAAGAAGCAGCCAA 
CAAAAGAGATGGTTGTGGTGACATCTCCTAAGACTATTGCTGTACTGCCTCAACTGGGA 
TCCAGCCCTGATGTGATAATTGAGGAAATTATTGAG^ 

AGATGATCCTATAGAGACTTCTGGATACCCAAAAAAGCCCCCTCAGAAAGAGCAGTCTG 
CTCCTGTTGGATCAAAAGCAGGTTGTCCAGAGCTAGTTTTTGTAAGTCATGTAATACAT 
CCTTGCCACTTCTATGTGCGGAAATATTGACAAAT^^ 

GAAGATGAAGCAAGTTTGCAATAGGAGCTTACACCTTGATCCTTCAGACATTTTGGAAC 
TAGGTGCAAGAJITATTTGTC^CAGTATTAAGAATAGAATGTGGTGTCGAGGAATTATC 
ACTGAAATAATTCCATCAAAJVACTAAAAATATTAGAA^ 

CTCAGTCTGTGAAATTTCACTAATA(1\GATATTCATGGTAGATTTTGGAAATTCTGAAG 

TCCTGATCATCACAGGAGTTGGTGACACACATGAGGGACCAGAGCATGATGGTGAACAG 

C^TATTACACTAAGTGACTTCTGTCTGCTTCTAATGAAGTCTGAACCATACAGTGAGGA 

ACTGTTGAAAGACATCCCACATTTAGC^CCTGTGCTCCTTGAAAGACATCGTCCCAT 

ACAATTCAGTAAGTGAGAGAGAAAGTGATTCTCCCTCAAAGGCTGTGGAGTTTTAGTTG 

TGTTGCCAGTGAAGTTGCTGAGTTGGTCTGAAAGAACCAGTTAGCCTTGACAAACACAT 

TCTGATTATGAACTATTAGTGCCAATAAAAGTTGCCATAAGCCTCAGCTTCCATAACTG 

AAAATATTTGTAATGAAAATTTGAGCTCAA^ 

CAAGTAAATACCACAAAAAAAAAAAAAAA 

Figure 10 

SEQ ID NO-* 10 Spgl3 encoded protein sequence 

MAAEASSTGLASCHLVESKSGAQGASGCQCTR^ 
APQEYTTSKCTDCEVHTWSMNQGHYPVIXS^ 

LIHDLEHSSSIHR]WSNPSAVMSETEEIDEALKIAGCNFEQLSNA.IKKLDSTQ 
THSLTEAVEKQFDTLIASLDSRITCSLCEELIR^^ 
KIAXELRSAPSLRTYCDLTQIIRTL^ 
SMGKIEFEDSTKCYPQENEDGQNVQKPGFTJNRK^^ 

P LQAEAPDRKLEGKKKQ PTKEMWVTS ?KT IAVL PQLGS S PD VI I EE 1 1 EENLESCFTD 
DPIETSGYPKKPPQKEQSAPVGSKAGCPELVFVSHVTHPCKFYVRKYSQIKDATILEKK 
MKQVCNRSLHLDPSDILELGARIFVNSIKNRMWCRGIira 

VCEI SL IQIFMVDFGNSEVL I ITGVGDTHEGPEKDGEQHITL SDFCLLLMKSEPYSEEL" 
IJODIPHIJ^CSLKDIVPYNSVSERESDSPSKAVEF* 

Figure 11a 

SEQ ID NO . : 11 Spgl4 cDNA sequence . 

acgcgggggagtccgcacctgtggctgttgggtccgcggctatggcgccaaagctctga 
agccttzaacggctttctcgcctggctggtgggctctctccgagctgagggccatctcct 
tcgattccaagrgtgggttccggcccagtggacccctcngctcaccATGGCAGAACCTG 
CLAACTGCAGAAGGAACTTCTGGTCTTGGACA 

ACAGGGGAAATGGAGCCTGCCACTGGAGTGCAACTTGCTGGTTCTGGAGAGCTGGTTGC 
TGAACCGGGACCCTCCAGTACAGAAGC^AGGGAAAATACAGAAGAGGCGAATACCATGG 
GGCAAACAGCGAATGAAGATCATTTTGACTGGGACAAGTACTTGAAA 
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Figure 1 lb 

ATGAGTGCTCCTTCTGAATATTTCAGACA 

AATTGGTATGAAATTGGAJVGCCCGTGACCCTCGCAJVTATTGATTCGGTGTGTGTTGCTT 
CGGTCATTGGAATTACTGGAGCCAGGTTACGTCTACGACTGGATGGTAGTGACAATAAG 
• AATGATTTTTGGAGACTTGTGGATTCATCAGACATACAACCTGTTGGGACGTGTGAJ^CA 
AGGAGGAGATTTACTTCAGCCTCCACTGGGGTACACACTGAATACTTCATCATGGCCCA 
TGTTCTTACTACGTGTACTAACTGGATC TGAAC TGGCAC CTGCAGTGTTC TTTAAGGAG 
GAACCACCACGTCCACTCCAAAATAATTTCATAGTTGGGATGAAGATTGAAGCTGTCGA 
TAGAJ^AAATCCATTTATGATCTGTCCTGCCACAATTGGAGCTGTCTGTGGAGATCAAC 
TTCATATCACTTTTGA-TGGATGGAGTGGAGCATTTGATTATTGGTGTGACTATGACTCC 
CGAGACATCTTCCCAGTTGGATGGTGTCGCCTCACAGGAGATGTATTACAGCCACCAGG 
AAAAATTGTTGAAAAAAGACCAAGGC GCAAAAGAAGAAC CAGACT ATGGAGACTTAGAA 
CTGCTCTTTTGGGAAATGAAGAAGAGGCCCCAGAAGCTGCAGAAGAACCTGGGACCAGT 
GTACTTACT-TTTjK AGATGAAAAC^GAACTTTGAAAGATTGCC GAGGAGAAGC TGCAGA 
AGAACCTGGGACCAGTGCATTTACTTTTGGAGATGAAAACAGAACTTTGAAAGA'ITGCC 
AAGGAGGTTGGAAAAAJ^CCCAAGGGC^GAGGATTCATCAAACCTGGAAAAGATGAAACC 
AGACCTGGGAAACATGAC CAAGGAGCCC CAGCTGGGAAAAAAC CCAGGGGC AGAGGATT 
CACCCAAC CTTTGGAAGATGAAGC GAGACC AGG AAG AGATGTC CAAGTAGCC CCAGCTG 
AGAAAAAACGCAAGGGCAAAACAGTCAC CACACCTTGGACTGAGGATCC CAGACTTTTT 
GCAGATC AAGGAGATG CC C CAGCTGAGAAAAAACGC AAGGGC AAAACA.GT CAC C AC ACC 
TTGGAC TC AGGATC CC AGACTTTTTCCAG ATC AAGGAGATGC 

GCAAGGGCAAAACAGTCLACCACACCTTGGACTGAGGATCCCAGACTTTTTGCAGATCAA 
GGAGATGCCC CAGCTGAAAAAAAACC CAA.GGGCAAAAGAGTCACCAAATCTCGGAAAGA 
CC AAGC C C AGTTTTTAGCTGATGAAGAAGCAATGC CAGC TCTTTTTTCAGCTCTTAGTG 
TGA.GCAGTACAGAGAGAACACCACCTTCTTCCTCTGAACAACCAAAGTCTTCTACCTCT 
GGGAAAACAAAATCCACCTCTAGAGGGGCTCAAACTTCAAGGAAGTCTCCACGGAAAAC 
AAGTGTTGTGCAACCAGTACCAAAGACCAGC^GAAAGCAGGAAAJ^TCTAAGTCTACTG 
GAAATACTTCATCCCCTAAGAAGGGCAT^^ 

-GGTGGAAAGTCTGGAAAAJ^GGAJUULAAGTATTC 

TTCTTTAJ^GTACACTGATGAAAAGTTCTTCATCTAA-TAAGACTTCTGCGGGGCCATCTA 
AAATAGTGATGTCTACAGTTTGTGTGTATAT^^^ 

CTGGATCCACAGAAGGTTCAGCAGCTACCTAACCACTTCGGTCCAGGCCCTGTGAATGT 
CATTCTCCAGCGGACTGTGCAGGCCTGTGTCAATTGTGCCTTTCAGGCCAAGGATGTGT 
TTCTATTTCTTAAAACAGATAATAGAGGAGGAGAAATGATAACTGCCTTCTTTGATGGG 
AAAGTTCLATACTGTTCAGCTCCCTCCA.GTGA^TAGTGCATCATTTGCACTTCGCTTTC^ 
TGAAAATTTCTGCCAAAGCCTGCAGTGTGATAACTTTTTGAGTAGCCAGCCTTTCAGAC 
GTGAGGCTCAAGTTCCTACCCCAGATACAGGCACTGA.TCAAAGCAAACCA.GAAAATGGG 
GAACCAAAGGAAAAGAGAAGCCTCAAACGATTGTCTCTGCATCCTCATCGTTCTGCTCC 
TGTCTCATCTAAGGTTCCC AGAAAGTCTGGGCAAGCGTCTAAAGGAAATTGA t gggaa a 
gccctgctactactgaagagrgagctgctgatgaagtatatggggctgaaactaggacc 
agcagtaaaactttgttattacattgaaaaacttaaagaaataaaccataactgaaaat: 
gtgcaaatrtagtttagagataatucccaggratactgaaaacattttactttaatagt 
agttttcazctcacccgtcttcacctuatagaatgttttzatagaaacatttuacgaaa 
gttgtagcacattaacagtatactcttautccttaaaLtccatctaatacgccugtiatt 
agcatga t uaaaac tggc- 

Figure 12a 

SEQ ED N0.:12 Spgl4 encoded protein sequence 

MAEP ATAEGTSGLGQQ VTK31GKPSTGEME? ATGVQLAGSGEL VA£?G? S S TEAR2NT22 
ANTOGQTANEDHFDWDKYIJCHTGSM^ 
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VCVASVIGITGARLRLRLDGSDNXNDFWRLVDSSDIQPVGTCEQ 
S SWPHF LLRVLTG S ELAP AVFFKZE? ? R2 LQNNF IVGMK IZAVDRKNP FMI C P ATI GAV 
CGDQLHITFDGWS GAFD YWCDYD SRD IFPVGWCRLTGDVLQ P PGKIVEKRPRRKRRTRL 
WRLRTALLGNEEEAPEAAEEPGT SVLTFGDENRTIjKDCP.GEAAEE PGTSAFTFGDENRT 

lkdcqggv^<pkgrgfix?gkdetrpgkr3)qga?agkxprgrgfto 
vapaekkrkgktvttpwtsdfrlfacqgdapaekkrkgktvt 
aekkrkgktvttpwtedprlf adqgdapaekkpkgkrvtksrkdqaqfladeeampalf 
salsvsstertppssseqpksstsgxtkstsrgaqtsrksprktswqpvpktskkagk 
skstgntsspkkgitxkr^pkkxggksgkkeksipvisstssaslstimssssl^ts 

AGPSKIVMSWCVYINKrlGDCGPFLDPQKVQQL^ 

AKDWLFLKTDNR^GEMITAFFDGKVETVQ 

QFFRREAQVPTPDTGTDQSKPENGEPKEXRS 

N - - Figure 13a 

SZQ ID NO.: 13 Spgl5 cDNA sequence 

CAGC^GTGACTATGGCATGATTGACGACTTGATC tac ttttcc aatgacgctgtgacga 
gtacagtgcttctgaacgtgggacaggaagtcattgctgtcgttgaagaaaacaaagtg 
tc\aatggactgaaagcaatcagagtagaagc?gtctctgacaaatgggaagatgatag 
caaaaactctagcaaagggttgtcagactccagccccagagtgctgattggctgtgtga 
cttccatgttggaaggtgctggctatatcagccagaccacatacttctctttggagagt 
gtgtgtgaaggtttccacccatgcaagggtgactgggtagaggctgagtattggatcag 
gccagggacatggagcagtgaggcaatctctgtgaagcctctgaggtacaagcgtgtgg 
acaaggtttgca.tttccagcctgtgtgggaggaacggggtgatagaggacagcatcttc 
ttcagcctggactccttgaagctgccggaagggtacataccgaggagacacgacattgt 

CAATGCTGTGGTTGTGGAGAGCAGCCAGTCATGCTACATCTGGAGAGCACTGTGCATGA 
CCCCTGTGAAGAGAGATGCCACTCTTGGTGAGGCCCCTCAGGAGCCCTATGGAGCACTC 
TTACTGAAAAACAAAGGGG AC ATTGAAGTTAC AAG AATGAC CAGTTTT GGAAC ATTGAA 

GGAAGGAGAAAGGAAATGAATCGTGATGTGGATAGAGAATAAAG^ 

AGCTTGTGAGTTGCAGACTGGCTAACTGGGA?AAAGCACACCLA.GTT^ 

CAGGGCAGAAGCAAGTCCTGCCCAGGAGCGGCTGCTGGGTCTGTTCCTGAAGGTGAAAA 

TGTTAATTCATTGAATCATC\CAGAGA^GACAAAACTGATGAGATTCC\GAGAGCCGTC 

TGGCGAACAGCACAGAAATCTCTCCAGATGGCTGCGCTTGTAAAGAAGAAAGTAGAGAA 

AAAGGAAACACGCCAGAGAAAC AGGAGCCAGAGC CTGGGGGGCTCATTC CTCCGGGGGA 

GAAGACTCAC ATTGTGGTCAC ATGCAGTGCCAAAAAC CC TGGCCGTTGCAAGGAGC TGC 

TTCTGCTCTGTTTCTCCGACTTTCTCATTGGGCGGC^TCTTGPAGTGAGTGTGGTGAGC 

AGC GAGGAGGC C CTGATAGCTGTGC GTGAGCC GTTTTCTTGG A^G AAGC CTAAAAGC TC 

CCA^J!1ATTAGTGTCTGCAAAGACTACAGTTGTTGTAACCACACA^ 

GGCGAC AACTTC CAAGTTTTCTTCC AC AGTATC C AAT AC C AG AT AG AC TT AAAAAATGT 

GTGGAGCAGAAGATTG ACATC CTGACTTTCCAGCCGC TTCTTGCAGAGC TCTTGAAC AT 

GTC^ACTACAAGGAGAAGTTCTCCACCCTGCTGTGGCTAGAGGAGATCCATGCAGAAA 

TCGAGCTGAAGG AGT AC AAC ATG AGCAG AGTTGTC C TC AAGAGGAAGGGGGATC TGCTG 

GTCCTGGAGGTCCCCGGGCTCGCAGAGAGCCGGCCTTCCCTCTA'TGCAGGTGACAAACT 

GATTTTAAAATCTCAAGAATACAATGGACATGTCATTGAA^ 

AGATTCATGAAGAAGATGTAACTCTTAAACTTAATC^ 

TTTGAACCTATGGATGTGGAGTTTACATACAATCGGACCACAAGCAGACGGTGTCACTA 

TGCACTTGAGCAGGTCATCCATTTGGGTGTAAAAGTAT^ 

TACAGTCTCCTCAGGTGACAGGGAATTGGAGCCTTGCACA 

CAGTCCATCA.CCAACATCACCAGAAATGATGGACAGTCCATGACCAAGGTAACC\GAAA 



WO 01/66752 



12/108 



PCT7US01/07371 



Figure 13b 

TGACAGCCAGTCCATCACCiLiCATCATCAGAAATGATGGACAGTCCATCACCAACGTCA 
C CAGAAATGACGGGCAGCCCATC AC CAAGGTAACC AGAAATAACAGCC AGTC AATCACC 
AACATC ACCAG AAATG ACGGGC AGC C CATCAC C AAGAACAAG AAAACLAGTGAAGGACCA 
AACTAAACA.C a .CAAC^GAGGAAAGGCACGTGGGTACCACGGACCAGCCAGAGAAGGCOT 
CCTCCACTGCAGAGACTATGGATGAAATCCAGATCCCAAAAGCACGAGATAAGGAGTTC 
TTCAACCCAGTGCTCAATGAAAACCAAAAGCTGGCCGTGAGGAGGATCCTGAGTGGCGA 
CTGCCGGCCTCTCCOVTATATCCTTTTTGGACCTCCGGGAACTGGAAAGACTGTGACTA 
TAATCGAGGCTGTTTTGCAGGTACATTATGCTTTGCCGGACAGTCGGATTTTGGTCTGC 
GCTCCTTCCAACAGTGCTGCTGACCTTGTGTGTTTGCGACTTC\TGAGAGC a AGGTGCC 
GAAGCC^GCTGCCATGGTCCGGGTGAATGCCACCTGCAGATTTGAAGAGACTATTATTG 
ATGCCATCAAACCGTATTGCAGAGATGGAGAAGATATCTGGAGAGCCTCACGCTTCAGG 
ATAATAATCACTACATGTAGCAGTGCAGGACTGTTTTACCAAATAGGAGTGAGAGTTGG 
ATACTTCACACAJGTATTTGTGGACGAGGCAGGACAGGCAAGTGAGCCAGAATGCCTT^ 

TTCCTTTCGGACTGATTTCAGACATCAA 

CAGCTCGGCCCAGTC\TCAAGTCC^GGCTGGCCATGGCCTATGGGTTGAATGTGTCC^ 
GTTGGAGAGGCTGATGTCCAGACCAGCGTACCTGAGAGACGAAAATGCCTTTGGCGCTT 
GCGGTGCATATAACCC^TTGTTGGTCACAAAGCTTGTGAAGAACTACAGGTGCCACTCG 
GCTCTGCTGGCACTGCCCTCACGCCTGTTCTACCATAGGGAGCTTGAGGTCTGTGCTGA 
TCCCAAAGTAGTGACTTCACTGCTGGGCTGGGAGAAGCTGCCCAGAAAAGGCTTCCCTC 
TCATCTTCCATGGAGTGAGGGGGAACGAGGCTCGTGAAGGGAGAAGCCCATCGTGGTTC 
AGCCCAGCCGAGGCTGTCOW3GTCVTGCGCTACTGTTGCCTCTTGGCCCGGAGTGTCTC 
CAGTCAAGTGTCTTC CAAGG AT ATAGGTGTCATCACAC C CT ATC GG AAGC AGGTGGAAA 
AAATAAAAATC CTTCTGCGAAATGTGGATTTGACTGAC ATAAAGG TTGGC TC GG T AGAG 
GAGTTCCAGGGAC.AAGAGTACCTGGTCATCGTCATCTCCACTGTGCGGTCAAATGAAGA 

TAGATTTGAAGATGACCGTTATTTTTTGC^TTTCOT 

TTGCAATCACAAGACCCAAAGCACTGCTGATCATTCTGGGAAACCCTCATGTGCTTGTC 

AGAGATCCCTGTTTTGGAGC^ 

GTGTGATCTGCCTCC^AACTC 

AAAAGGTAAAGCACCGTGGAGGAAAGAGTGTGGCTCCACGTGTTCACCTTAAGCAGGCT 

GTGGCTAGACAGCTGTGCCAGGAC C TGTGGAC ATGGTGG AGTC TGC T AC AAC AGGG AGC 
CATTGAGCCTCACCCTATGGGCCATTAGTCCAGCCATGCTTCAGTCTTCTGTGACTCCT 
GCGGCTTCCTGGTCTC^GACTGAATGTTGGTATGOVTGGGACCACTGAGTCAGCTGGG 
CTGCTCCTGCTTCCTTGGACTGACCTTGGTTCCTAACAGTTAGTTTCTGCCTGTGGGCA 
ATCACTGCCAC TAC ACTCCC CCAAATAAACACTTC CAT A^.CC C 

Fiffure 14a 

SEQ ED NO.:14 Spgl5 encoded protein sequence 5 

MI DDL I YF SND AVT S TVL LNVGQ EV IAVVZENKV SNGL2CAI RVEAVS DK^TEDD S KNS S K 

(aSDSSPRVLIGCVTSmiEGAGra 

SEAISVK?LRYKRVT)KVCISSLCGI^G^^ 

ES SQ SCY IWRALCMT PVKRDATLGEAPQEPYGALLLKNKGD IEVTRMT3F GTLXEGE SK 
S IVTWI HNKG KF S RZL VS CRLANWDKAHQ FRFETQGRS 7. S C PGAAAG SV? HGENVNS LN 
HHREDKTDE I ?E SRLANS TS I S PDGCACX2E SREKGNT PEKQ E? EPGGL I ? PGEXTHIV 
VTCSAKNPGRCKELLLLCFSDFLIGRHLE^ 
AKTTVWTTQKRNSBRQLPSFLPQYP^ 

KFSTLLWLEE IHAE IELK^Y^SRWLKRKGDLL VLEVPGLAE SRP SL YAGDKLI LKSQ 
EYNGHV~EYIGYVMEIHESDVTL^^ 

IHLGVKVLF P EE I ILQSPQVTGNWSLAQDTKNDGQS ITNITRNDGQSMTKVTRNDSQS I 
TNI IRNDGQ S I CTvTRNDGQ P ITKVTRNNSQ S ITNI TRNDGQ ? ITKNKKTVKDQTKKTT 
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Figure 14b 

EEI^GTTDQPEKASSTA^T^E 
■ YILFG? PGTGKTVT I IEAVLQVHYALPDSR JLVCAP SNSAADL VCLRLHES KVP KP AAM 
VRVNATCRFEETIIDAIKPYCRDGHDIWK^SRPRIIITTCSSAGLr/QIGVP.VGYFTW 
FVDEAGCASEPECLIPLGLISDINGQIVIA^^ 
SRPAYLRDENAJXSACGAYNELLVT^ 
SLLGWEKLPRKGFPLIFHGWGNEAJREGRSPSW 
KDIGVITPYRKQVEKIKILLRNTO^ 

RYFLGFIjSNSXHFNVAI TRPKALL 1 1 LGNPHVLVKDPCFGALLEY SVSNGVYTGCDL PP 
ELQALQK 

Figure 15a 

SEQ ID NO.: 15 SpglS cDNA sequence 

cctggctatgcggctagtatccggaggacagacgggggcctcttcctgctcgctgatgt 
ctctcataaggtcattcggaacgactctgtgctggatgccATGC^TGCTATCTACaAGC 
AGAACAAGGAGCACTTCCAGGACGAGTGC^GCAAGCTTCTGGTTGGCAGCATTGTCATC 
ACGCGCTACAACAATCGTACCTACCGAATCGATGATGTGGACTGGAACAAGACCCCTAA 
AGACAGCTTTGTCATGTCGGACGGGAAGGAAATCACATTCCTGGAATACTACAGCAAAA 
ACTATGGGATCACAGTCAAGGAAGATGACCAGCCGCTGCTGATCCACCGGCCCAGTGAG 
AGACAGAATAACCATGGCATGTTGCTGAAGGGCGAGATCCTGCTGCTGCCCGAGCTCTC 
CTTCATGACGGGGATCCCTGAGAAGATGAAGAAGGACTTCAGGGCCATGAJVGGACTTGA 
CTCAGCAGATTAACCTGAGCCCCAAGCAGCACC^CGGTGCTTTGGAATGCCTGCTGCAG 
AGAATTTCACAAAACGAGACAGCCAGCAATGAGCTGACCCGCTGGGGGCTCAGTCTGCA 
TAAAGA.TGTCCAC^AGATTGAAGGTCGGCTTCTGCCAATGGAGAGGATCIAACTTAAGGA 
ACACTTCATTTGTC^CATCGGAGGGCCTGAACTGGGTTAAGGAAGTGACCAGAGATGCT 
TCCATTCTAACTATTCCC\TGCATTTCTGGGCACTCTTTTATCCAAAGAGAGGAATGGA 
CCAAGCCAGAGAACTGGTTAACATGTTGGAAAAGATTGCCGGGCCCATTGGCATGCGCA 
' CAAGCC CC CC AGC CTGGGTTGAGC TG AAGGATGACCG AATAGAGAC C TATATCAGGAC C 

ATTCAGTCCTTACTGGGAGTTGAGGGGAJIGATAC 

CACACGTGATGATCTCTATGGAGCCATCAAGAAGCTGTGCTGCGTGCAGTCCCCAGTGC 
CCTCAC^GGTCATCAATGTCCGAACCLATTGGTC^^ 

CAGAAAATTTTACTTCLAGATGAACTGTAJ^ACTGGGTGGTGAGCTCTGGGGAGTGGATAT 
TCCGCTGAAACAACTAATGGTGATTGGAATGGATGTGTACCATGACCCCAGCAGAGGCA 
TGCGCTCTGTGGTCGGCTTCGTGGCCAGCATAAATCTCA^ 

AGGGTGGTGTTCCAGATGCCACATCAGGAGATTGTGGACAGCCTGAAGCTCTGCCTGGT 

GGGTTCCTTGAAAAAGTATTATGAGGTGAACCATTGTCTCCCAGAGAAAATTC 

ACCGAGATGGAGTGTCTGATGGCCAGCTAAAGACAGTTGCCAACTACGAGATCCCTCAG 

CTGCAGAAGTGTTTTGAAGCCTTTGATAACTACCACCCCAAGATGGTGGTGTTTGTAGT 

TC^GAAGAAAATC\GCACC AATCTGT AC CTTGCTGCTCCTGATCACTTCGTAACC CCCT 

CCCCCGGGACTGTGGTTGATCATACCATAACCAGCTGTGAGTGGGTGGATTTCTACCTT 

CTTGCCCATGATGTGCGACAGGGCTGTGGCATACCTACACACTACATCTGTGTTCTGAA 

CACTG£AJ\ATCTGAGCCCTGATC*^^ 

ACTGGAATTGGCCTGGTACCATCCGAGTTCC\GCTCCTTGCAAGTATGCCC\CAAGCTA 
GCTTTCCTGTCCGGACAGATTTTGCATCATGAGCCA^ 

GTTCTTCCTGTAActgggaacttggacaccggctgcaaggagcaaccggactcagctca 
gctcctccctacagaatcaacagaaatggcagtggaatuttiatgtctgcatcttcTictt 
tctccatctttgragaattagactcccgttcttctcttaaccctgatatcacagtaggg 
tgttgtggtgcatgcctcttatcccagcacctgggagactgaagcaggagtatctttag 
ttcaaggccagcctagactacatggngagtcctaggc^agccaagattacagagtgaga 
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ccctgtctcaaaaaacaaaaacaaaccactgtcccctcaaagcccacaacaaaaccaaa 
gcctggggtcaaggaagcaagtttgtaggtagccgctggctgcccgctgccttcatgga 
gtgtgtgccgtcagcgctgcttctcctcagccgagcgtggagcttcggacagggcagtg 
atgacatgttcttagcatgtcaaatccccctaccaaataagtcaaacaaggaaaaaata 
gcccccaaggcagcctgagcatcagttcctagaggttatgcctacagaaccatcctatt; 
tctggtggcagaagtgacatgaagtcatcgcagacatcttaaaggagagctactgtgca 
gctgtctacatgtgtgaaagacatgtagaaaaaccagcgtagggtacagtcgggcgtat 
gtgcccacctgacccagggctgtgagtctgacttcccgagagtctggctagagctgctt 
ttctggtccttttggttatttgcaaccatcatcagattgctttcctgcagcccgactga 
tacacgcatgtgcgcacgcacacacttttgttctttgtgactaatctugaataaattca 
attagaacacatggaaaggattcagcagacctaggaacatttggggtggagtgtagttt 
ttctgcaaaagtctgtaaatgagattacgcaagagtctcttccagctgtgggctggtgt 
tgcttggaaaatttcaaaatcccaaagtttcaggcttcccaaagtfcggcctggaaaaat 
gtgatagtctcacctgagtctagacatgtaggaaattttcctagggcctctgggcttca 
gtatttgggggaagcactggttttctgtgttatttttttccttctgtttcagaatcttc 
aagtttcttcaggctucagtgggccatccctttactgggctctaaaagctaattttact 
taaccttttcaaatgtgtatgtatcgatttatgttttggtgtggtggatggatcgtagg 
ggactgagcagaaatagtcattttaaataacagtgtgctaggagagcctcagtgtgaag 
tcctgaggagcagcgggggctgtgggagtcagtgtcagccctcactcagacaggccaag 
cctgggctcgaagacaacattgtccagggaagcctttagtttgctatagcacccagagg 
ttggcgaggaataagggttagggtcctcaaatacacccatggcttttggagtctatgac 
caaggccaggctgagacctgaaatgtaaaagccatagaattaaacagaacagactgaaa 
aacgagtcttaagttccactttctggatttctctggaagtcttctgaatttgtctgtag 
aagttgctgttcctagcactccttttctctttaggtagaacagatacctgaccataatg 
ccagaatgtactttttctgccttggtttttttatgccttgtttttcagtttcagggcca 
aacaattggqccctgtggtgtaa aataaa aacaatgtatgtgta taaaaa 

F" ©16 

_SEQ_„ID 30. „:!£_„ Spgl6_ encoded protein sequence - — ^ 

MHAIYQQNKEKJQDECSKLLVG^^ 
LEYYSKNYGITVKEDDQPLLIHRPSERQNI^ 

RAMKDLTQQ DJL S PKQHHGALECLLQRI SQNETASNELTRWGL S LHKDVHKI 2GRLL PM 

ERINLRNTSFVTSEGLNWVKEVTRDASILTIPMHFW 

GPIGMRTSPPAWVELKDDRIETYIRTIQSL^ 

CVQSPVPSQVI1WRTIGQPTRLRSVAQKILLQMNCKLGGELWGTO 

HDPSRGMRSWGFVASINLTLTKWYSRVVFQMPHQEIVDSLKLC 

PEKIVVYRDGVSDGQLKTVANYEIPQLQKCFEAFD^ 

DHFVTP S ? GTWDHTI TS CEWVD FYLLAEHVRQGC G I PTHY I CVL*NTANL S FDHMQ RLT 
FKLCHMYWONWFGTIR . 

Figure 17a 

SEQ ID NO - : 17 Spgl7 cDNA sequence 

ggaaggATGTCGGAAGCTGAGGCAAGCAGTGGGATGGCCCACLAACGCTGGACCTGATGA 
GAAGACATTGCAGGTGTTGCGGGACATGGCCAACCGCCTGCGAATCCGTTCCATCAGGG 
CCACAAATTC CTCGAC CACTAGCTATCTCATACCATGCAGC a -ATGCTGAGATCATGTCT 
GTGCTGTTCTTTTATACAATGAGATATAAACAA 

CCGGTGCATCCTTTCAAJ^GG^ACTACCATTTGTTAATGTGGCAACAGGATGGCCTGGAC 
AAGGACTAGGGGCTGCCTGTGGGATGGC^.TATACTGGCAAGTACTTTGACCAAGCCAGC 
TACCGGGTGTTCTGTCTTCTGGGGGATGAGGAATCCACAGAAGGCTCTGTTTGGGAGGC 
ATTTGCC TTTGC ATCC TAC TAC AATTTGGACAATC TT ATGGC AATCTTTGATGTGAACC 
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GCATTGGACACAGTAGCTCCATGTCTGTTGAGCACTGCATTGCCATCTACCAGAAGCGT 
TGTGAAGCCTTTGGGTGGAATACTTACGTGGTGGATGGCCGTGATGTCAAGACCCTGTG 
CCATGTATTCTCACAGGCAGCTCAAGTGAGAGGCAAGCCCACTGCTGTGGTTGCCAAJ^ 
CCTTC-.AGGCCCGAGGC ATGCCAAATG TTGAGGATG C GGAAAGTTGGTATGGAAGGC CA 

ATGCCAAAAGAAAGAGCAGATGCAJlTTGTCAJiGTTAJ^TTGAGAGCCAGATACAGACCAA 
G&AAATTCTTGTACCATCACCCCCTATTGAGGACTCGC^ 

TATGTATGACTTCGC CC CCTGTTTACGTAGCTGA.TGACAAGGTGTCTACTCAAAGAGCA 

TGTGGTTTGGCTCTAGCTAAACTGGGCCATGAAAATGA-CAGAGTTA 

TGACACCAAGAACTGCAATTTTTCTGACATATTCAAGAAAGAACACCCAGAGAGATTCA 

TCCAGTGTTGTATTGCTGAACAAAACATGGTAAATGTGGCTC TAGGATGCTCCACC CGT 

GACCGGACCATTGTTTTTGCTTACTCCTTCGCTGCCTTTTTCACCCGAGCATTTGATCA 

GATCCGATTGGGAGCCATCTCTCAAATCAACATCAACCTTATTGGTTGTCACTGCGGGG 

TATCTACTGGTGATGACAACCCTTACCATATGGCCCTGGAAGACCTGGCCATGTTCCCJA 

GCAATTCCCAATTGTGTTGTOTTCTATCCAA 

TTACTTAGCAGCCAATACTAAGGAAA^^ 

CAATTATTTACACCACTCAAGAAACCTTTC^GATTGGACAGGCCAAGGTTGTCCGCCAT 
AGTGACAATGACAAGGTCATAGTTATTGGAGCAGGAGTCACCCTGCATGAAGC C CTAGT 
AGCTGCTGCTGAGCTCTCTAAAGAAGATATA.TCTATCCGTGTCATTGACTTGTTTACCA 
TTAAACC TCTGGAC ATTGC CACCATCATTTC CAATGC AAAAGC CAC AGGTGGC C GAATT 
ATCACCGTGGAGGATCACTATCCTGAAGGTGGC^TTGGAGGAGCTGTCTGTGCAGCAGT 
CTCCATGGAGCCTAACATTGTGGTTCATAACCTGGCAGTAATGGATGTGCCTCGTAGTG 
GAAGATGCAATGAAGCACTAGATTTTTCTGGAATCAGTTCCAGACACATCATTGTGGCT 
GTGAAATGCATCTTAATGACCTAAattagctgttgtcrtcggtctgagaagactagcct 
cttctctgtatgtttatgcttgtgccaaaaccatcaattagatagaaatgatgattgcg 
ctttgttctccttaaaggcaaaaccagttaacacctttgttccacatticagaaaiitcta 
attcccctctaatctgtgactacacatacaaatatcactgttttaaccatcaaaggggc 
tataagatttataatggcagaatagataacaagacaactacctgacactgaactttctc 
- a taagac t gaa ta taagtgga 1 1 aaggaac aa tg t agaagtgacag t c't ggaagta c 1 1 
tcttcataagtaaagcaaatgtgtgttgaatcgtagtccgcagcccacctttcacagaa 
ctagacccccgagatacatagtgctgttcaacttrgagttaccatgtacactgcctgcag 
gttcctgagttatgtgcaatttcagatgcncccgatttttctctgctgtttccctttat 
atcccctccacagttcacacgttgatgaacagaacaaaagcagttatgatcacttgtgc 
tgttgtgatgactgtuctgtgaagacgtaaggacactcgatctccacactatgctgttt: 
gatgt taataaagca t z taa 

Figure 18 

SEQ ID NO.: 18 Spgl7 encoded protein sequence 

MSEAEASSGMAHNAGFDEKTLQVLRDMAl^^ 
FFYTMRYKQEDPEHPDNDRCILSKGLPFVWA^ 

VFCLLGDEESTEG SVWEAFAFASYYKLDNLMAIFDVNRI GHS S SMSVEHC IAI YQKRCE 
AFGWNTYWDGRDVKTLCHVFSQ 
KERADArtfKLIESQIQTNKILVPSPPIEDSPQINI^ 
LAIiAKLGHSNDRVIVLGSDTKNCNTSDIFKKZKPER^ 

TIVF AYSFAAFFTRAFDQ IRLGAI SQ INI3nLIGCHCGVSTGDDN?YHMALZDLAIy[?HAI 
PNCWFYPSDAVS TEHAVTLAAOTKSMCF IRTSQAETAI I YTTQ ETFQ IGQAKVVRKSD 
NDKVIVIGAGVTLHEAL VAAAEL SKED I S IRVIDLFTIKPLD IATI 1 SNAKATGGRI IT 

vedhyfegg i gga.vcaavsme pni vvenlavmdvprs grcnealdf sgissrhi ivavk 
cilmt. 
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SEQ ID NO.: IS Spgl8 cDNA sequence Figure 19 

ggcacgaggcgcgaagccctcacgctcgtgccgcggctctgaggcggcaggtcgtggcc 

gaatctccgcttgcgagtggggccagaggctcgtgtctccagagaatgATGGGlAACCA 

CCTTGTAAAACCCGACTCTAGAAACTGCAAGAGGGC.\AGAGAATTGGAGCCTCAGGTGT 

CTGATAGTCCACAGGTATCTTCTCTTGGAAAATCAGAGTCATCTCTCTCTGAGGCTTCT 

GGACTCTTTTATAAAGAGGAAGCTCTGGAGAAGGATTTGAGCGATATGAGCAAGGAAAT 

TAATCTGATGTTGTCTACATATGCAAAGATTTTAAGTGAAAGAGCAGCAGTAGATGCAT 

CTTACATCGATGAGATAGATGGACTCTTC\AAGAAGCCLAATATTATTGAAAACTTTCTA 

GTACAAAAAAGAGAGTTCCTGAAGCAGAGGTTTACAGTAATTACCAACACCCTACACAA 

GTAAtgtgcctatgccagctaaaagtccttcctgttactgctgtgttcttcttgctaga 

gaaaacatacattaaactgtaactccttctaaatttaaaagaagttttaagatagatat 

taatatgaagtgtgtaatatttcttttgagggtcaaacatttggcacattatataaaaa 

tataaattaaaaattatatgcatgttfcttttctttattgtttattcctaaattgcttag 

cccttcttaaactatgaaagaggactctgttaatttgafctatgcttaacaatatttgtt 

taaatagcagatgatttttgagatagttttaaagcgtttttcttgatttttattataa 

SEQ ID NO.: 20 Spgl8 encoded protein sequence Figure 20 — 

MMANHLVKPDSRNCKRARELEPQVSDSPQVSSLGKSESSLSEASGLFYXEEALEKDLSD 

MSKEINI^STYAKILSERAAVDASYIDEIDGLFKSANIIEOTLVQKREFLKQRFTVTT 
NTLHK 

Figure 21a 

SEQ ID NO. : 21 Spg25 cDNA sequence 

GCAAGAGTCAAGAGAGTGTTGGTGTGTTCAGGCCAGAGTCTCTGTTCTATACAGCAGCA 
AATCCTAGGACTTGATTGCTCTCCATTC^CTGTTTCTACCTGTGCTGAGTGCCTGTCCT 
AAGATTCTGATAGAGGTTGTTTTCCCTTGGCATATACACACACTGCTGAGTTGGGAAGC 
TGCTTCATTTGCATTTAGCATCCAGTGCCAGTCTCTGGTAAACTTGGGAAACAAATAGA 
AACTCCGAGATTGTTAATCAGTATACGTGAGTTTACTTGCAAAGAAAAAAGAAGTATGG 

. - _ AGCCC^TATTGATAAATGCTCAAGTCCAGATGTGGAGTGGAAAGGCAGGAATGTCCAAG 

TC^AGAAATGCACTCATTGAAACATGTGTAGGAAAACGAGAAGTTAAACTTATTCTCTA 

TTTCAGCACTGGAAAGATTAAGACTTTGCAACTACACGATAATATTAAAAGTGTGGTCC 

TTCAAACCTATGGCGAAGACCAGAATTACCTACATTTGACTTTTAAAAATAATGATTTC 

TTGTTTGTTGAGAAACTCACCACCACAGATGCCAGAAGACTGAAGAGATTTCTAGACAA 

AACCTCTCAAGGTAGTATTCGGCCAGCCAGAAGTGATGAGAGATGTGGTGAGCCTAGCA 

CAAGTGCACAGGAGTTGAATGGCTCTGGAAGTTOVTGTGAAAC2AATAGTGAGTGCTTT 

GAATCACCCAAAGAAAGTGAAATGTGCATGTTTCGTGAGTTGTCTTTGCTTCCATCCTC 

ATCAACCTTTCTTCACAATGTAGGATTATTAGAAAACCAATTCATAAAGAGGAAAAGAT 

TTOTCTCTGATTTAGCAAAAAATGAAAA^CAGAGCAACCTGAAGGAC^.GTATC\GGGAC 

TTTGAGGCAAATTTAGTGGTGTGTATCTCTAATGAAAAGGGAAAAGAAAGGAATGTAAG 

AGAAGTAGACATCAGTAAGCCAGGGTOTGGATTTCCATTTGAGACCAACTATCCTGAAG 

ATAGTGGTGTGGATGTTCGTGA7CTTAATGATCTCATTACAAAATTATTTTCTCCAGTT 

CTGTTGGAAACACACTGTATTGAGAACGGCCTAGAGTGGCa.TGAATATATGAAGACATA 

CTTGCTTTACCCAGAGAAATTGTGGCAAGGCCTGCCTAATGTGGGAAACACCTGCTATA 

TAAATGTTGTATTACAGTCTCTATGCTCAATACCACTGTTTATTAATGATTTATTCAAC 

C\GGGTTTCCCATGGATTAAAGCTCCCAAAGATGATTTTAACATGCTCTTGATGCAACT 

GCTTGTTTTGAAAGATATTTAC\ACGC\AGATTTAGACAGAAGTTACTTATAGGTATTA 

CAAAAGCCCTCCCCATATTTGGAGAGATATTTGCTGTTGACAGGCAGAATGATGCTCAT 

GAGTTTTTAAGTCTCTGTTTAGTTC\GTTGAAGGAGACTTTCCAAAGAGTAACCATGAT 

GTGGC^GTCTGAAAATGATTCGGGGGATTTTTACTTACTTAAAGACATTTTTGCTGATT 
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Figure 21b 
ATGCTACTATCAACVJIATGCCCGTT^^ 

CTAAGCTCCA.TTTTTTGTAAAGCTTGTGGCCTGACTCTTTTTAAGGGAG 
ATACCTTTCTATCAACATTCCCCAA_GGAGGGAJ^GACATGTC CATCCAGTC CACTTTAG 
ATCTTTTCTTTAGTGCAGAGGAGCTTGAGCATAGGTGTGAAAAGTGTTTG 
TCTGTTTCATTTCACAGGTTTGGCCGGCTACCCAGGGTAATTATTGTTCATCTGAJLACG 
OTATCA.CTTTAATGAGTCATGGGTAATGAAGAAGG ATG AGCGGC C CATCC TTGTTTC C A 
AATACTTAAGGCTGTCTTGTCACTGTA.GCAAAAGCACAAAJ^CCGCCCCCACCCCTTCGC 
CCAGGTGAACATGTTAAGAATCTTGACTTATTAAJ^CCCCTTGAAGTGTTGGGTTCCGA 
AATAC TCAAATTGCCTTTTAATTCAGTGAGG AC CTC TAGATC CAAGGGTTTC GAAAC TA 
TAAACATCACATCAAJVCAGGGAGTCAGAAGCACAA^ 

TTGAGTGG AAAAGTGC AGC A.GGAJ^JVATTCAGGGAAAGG TGACACAGCACAT ATAG TTGG 
GTCAGAACTTA.CAAAGGAGACTGAGAJLACTCAAGAAACATGAGGAAGAGCATAGACCCA 
GTGATTTAGATTC TGGTAGTATCAGGG AGGCC CAAAA.GTAC CAACAGGCTG AGAPLATGT 
AACGAAGGGAGAAGTCATAAGCAGATTTCCCTAG 

ACCAATCTCC CAGGAACAGACAGAAAACCTTGGGAAJVACTACACTGTCACATAC CCAGG 

ATAGTAGTCAG AGTTCACAGAGCTCATCAGATTC C AGTAAGAGCTC C C G ATGCAGTG A? 

GATCTCGATAAGA^GGCAAAGCCTACACGCAAGGTGGATCCAACAAAGTTTAATAAA^ 

AGAAGATAATGTTT AC AGGCTTGTT AATATTATCVAC C ATATTGGGAACAGTCC CAATG 

GAGGCCACTACATCAATGATGCCTTTGACTTCAAGAGGCAGAGTTGGTTCACTTATAGT 

GATCTACA.TGTAACAAGAACCCAAGA^GACTTTGTATATAGGGGTCGGAGTTCTACTGG 

GT ATGTC TTC TTTT ACATGCATAATGATATATTTGAAGAGCTC TTGGC AAAGGAAAC TC 

AGTCTACCAGCACATCCAAGGGTTAGTAAGAGGAGTGTATGCTTCACA.GTACATGTCTA 

TCCAAATGCCTCACTTATCTAAATTGGATAGAGAAGGACAGATAATTAGCCCAGGACCA 

ACAGCTCAACGAACATTTATAGAGAAACGTTTGTTCCA^CCCTGACCACAT^^ 

TAATTACAGCTCATGCTAATGAGTCTTCTCTTATACAAGTTTGGAACTGTAACTT^ 

ATAGTATTTTGTCATAAATTTTATTATTGACAGTAAGGTTTTTCA.GTATTTTGGTGGAJ\ 

GTTATATAATCCAAAGTTGCCT^ATACCGATTAAAATGACTTAOT 

AAGGTATGTTOTCTCTCATTTCTGCTTTCTTTCTTGCATGGCAGCATAATAAGTTCTAA_ 

TGAAAAAGCTTTTATATATttGAAGGAJU^-CCACACATC 

GACGAGTCTATCCAGCGCTGTTACCCCAGCCTGCCGCTCTCTTGCTCTTTCTACATTTG 
GAATGGTTCGGTGTCATCCTCCC CTC CTGGCTTTCTC ACAGTGGA.TGCCAATTAATGGT 
GATGCTTAAGTGAAGAGAGAJIGA.GA^CATTTTATGGCTGCACATGTT 
AAAATGGGGAGAGGGCTGCATT^AGCTTTTGTTGATTTTTAGGAGTTTTCATTAACA 
GAACAA^GTTCTTACTGTTCTCTGATTTAGGCCTTGGTTCT 

AGTTTGATTTTGTTAAGTCTGTTTTCCAGTGTCTGTCTTTAGTTGTCTTGTCTTTAATG 
GGGTTATTTTCTTATTAOTAAAJ^^ 

CTCTGCATGTGCTCGGGACCTACTGTGTGCTCCTTTCTCCAGAATTCTCTGTTTGGGAT 
GGTTGAC ACC GT 

Figure 22a 

SSQ ID NO. :22 Spg25 encoded protein sequence 

MEPILISIAQVQMWSAKAGMSKSR^^ 
VliQTYGHDQNYLhXTFKKtTOFLFVHKLTTTDAi^ 

STSAQSIjNGSGS SCETNSECFES PK25ZMC4FRELSLLP SSSTFLHWGIiLENQFIKHJC 
EFF SDLAKNEXQ SNLXDS IRDFSANL WC I SNEKGKSRNVREVDI SKPGFGFPFETNY? 
EDSGVDVx^LNDLI TKLFS PV^LETEC IENGLEWEEYMKTYLLY? EKLWQGLPNVGNTC 
YEWVLQSLCSIPLrllTOL^ 

ITKAL ? I FGE IF AVDRQND AKEFLS LCLVQ LKETFQRVTMMWQ SEM3 SGDFYLLKD Ir A 
DYATINK^VCPVTNNFEFELLS SIFCKACGLTLFKGE? SRYLS 33*1 PQGGKDMS IQST 
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LDLx F S AZZLEHRCEXCL YOTCSVS r HRF GRL ? RVT IVHLKRYHPNE SWVMKKDERP ILV 
SKYLHLSCHCSXSTK?PPPLRPGEKVXNLDLLK?LEVLGSEILKLPFNSVRTSRSKGFE 
TINITSNRESEAQSGKKVSEVLSGKVQQENSGKGOTAHIVGSELT 

PSDLDSGSIREAQKYQQAEKCNEGRSDKQISLZALTQSRPKPISQEQTENLGKTTLSHT 

QDSSQSSQSSSDSSKSSRCSDDLDKXAKPTRKVD^ 

NGGHYINDAFDFKRQSWFTYSDLHVTRTQ 

TQSTSTSKG 

Figure 23 

SEQ IP NO . : 23 Spg27 cDNA sequence 

TTCCCTCAGGCGGTGCGTAAAAAAATATTTCTTGG^ 

AGGAG^GCTTCCCTGAGTGCATTGGTATTTCCCAATAAGATATCAACTGAGCATCAATC 

TTTG^TGTTTGTGAAGAGGCTCCTAGCTGTTTCAGTATCTTGCATCACCTA.TTTGA 

GAATATTTCCAGAACGTGCTTATGGGACAAGATATCTGGATGA.TCTCTGTGTCAjy^ATT 

CTGAAAGAAGATAAAAATTGTCCAGGTTCTTCACAGCTAGTGAAGTGGATGCTTGGATG 

C TATGATGCTTT AC AGAAG AAATATCTAAGGATGATCATTCTAGC TG TATAC AC CAAT C 

CAGGAGATCCTCAGACAATTTCAGAATGTTACCAGTTTAAATTCAAG^ 

SEQ ID NO.: 24 Spg27 encoded protein sequence Figure 24 

MATMQLQRTASLSALVFPNTQSTEHQSLMFVK^ 

DDLCVXILKEDKNCPGSSQLVKWMLGCYDAL^ 

KFK 

Figure 25a 

SEQ ID NO* : 25 Spg23 cDNA sequence 

ttgaccctttataaggcct-gttggtcctccctggtgttcagtgcttagcgaggagggc 
ctggctctggagtccattagctggcacctgggcgctcagtcagggagctccccatatat 

-tggagcaagt-gtgaa-getgaagasgt^ c t cttcaga 

gagcctctgtggttctgttgacctgacctcctcagtccafccgagcaagrctgaatctgt 

gtgtccgcctagctgcagacagacttcttgaccATGTGTCCCCCAGTCAGCGTTCGCCA 

TGGGGCCAGGGGCATGTCCTGCCTCTATGAGGCATGGCTGTACCATCTTGTCCATGGGG 

AACAGACGAAGATCTGCTTTGCTTGCTTCAAGGCAGCTTTCCTATTGAATAAACTCTAC 

CTGGAGATGGGAGACTGGCAAGAGGAGGAAGAGGAAGAGGAAGAGGAAGATGCTGATCT 

CCTGGAATACTTGTCAGAGTCAGAGTCAGAGTCTGAGCAGGAGCCAGGGCCTGAGCAGG 

ATGCATGGCGGGGATTGGGGTCCCTTTATGTGCCACAGAGTGTCTCTGAAGGGTCTGGG 

GTCCTGCTGCCAACCCCTGTGTGGACACAGGGCATACTATTCTCCArTTTT^ 

TGAGCTCTTTCCTCAGGAAGCTGTACCCCTGGATCTGGGTCCTGAGGATGCTGAGTGGA 

CCCAGGCCCTTCCCTGGAGACTTGATGGGCTTTTTCCCTGCTCGCACCAGCTCATCCCT 

CCTCTGACTTGGTGGGATATTTTTGATGTGATGCC^ 

GGAGTTGAGATGCCACTGGCCCTTGGACC\GAC^GTAGCACAATCCTGGTTGCAAGACC 
AGAAGTTTGTCCTCCTGTTGGATAGCGTCCAATCTAGGTGCCACCTGCTGTCAATGCGT 
GTCCGCTGGGTCGTAAGGACTCAGGTCCAGCACTGGCAGGTGTTGCTGGACCCTGGTGA 
GATGTGGGTGGCCC^TTTTCGGAAAGAAGTTGGGCAGCACGGCCTGTACCATCAGAGCC 
TGAATCCCTGGAGGCTGAGCATCCTGACAGC?TCAGAATTAGGGATGGAGTTATTGCCT 
GCC^CCTGCTACCTGTGGAATAAAGGCTTCTGGGTAGGTTCCTTCTTGCCCTGGCACAT 
TAACATGCCAGAGACCTGGAGCTGGGAGCCAGGAGAGAGGCTGTTTATCACAGATGCTA 
CTATTTGTGGTACTGACTACCACCTTGCTCAGTCTTTCCTTGA.TTCCCA.CCCCACCCCC 
CACCCCCTCCTGACCCTTACTCCCTGAtgcacattcctgagacacnaaagctcagacat 
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tccccagggccccggggac-gcgaagagcaagaggttgcctgttctgagcagctcaggg 
gaacagttggcccaggacttgagggggcauctcgaacatcctgtgagcttatgaacctc 
agagggaagcctggcatgttcgtgucagtgttcagtgtttggtaggtgaggcctagctg 
tatgtttagctguatggagtgctgugcggtgggtgttatgggggctccggtcacagatc 
tacgtatgtatggactctgaggcactagttgaccttactgtcataggggccatacgctt 
actgcctcagggtcaagacacccgattictagggttcactgttttitgttigtrttactttg 
ttcattactcgccctccttubgacTcqcttttgtt gaatiaaa gttiggcgtttt aaaaa 

SEQ ID NO.: 26 Spg33 encoded protein sequence Figure 26 
MCPPVSVRSGAJIGMSCLYEAWLYHL^ 

EEEEEDADLLEYLSESES2SEQEPGPEQDAWRGLGSLYVPQSVSEGSGVLLPTPVWTQG 
ILFSIFVTTELFPQEAVPLDLGPEDAEWTQAL?WRIJ)GLFPCSHQLIPPLTWWDIFDVM 
PSPGQPVLLELRCKWPLDQWAQSWLQDQKFVLLLDSVQSR^ 
WQVLLDPGEMWAHFRKEVGQHGLYH^ 

VGSFLPWHINMPETWSWEPGEKLFITDATICGTDYHLAQSFLDSHPTPKPLLTLTP : 

Figure 27 

SEQ ID HO.: 27 Spg34 cDNA sequence ° 

ggcacgagctgcagcctagctgttgagctgctctagcgatcatctgttctgaggcactt 
tgggactgtgggactgtgttctttgctcctgccctgcccctgtgagcgggtggggaatc 
gctgagcctgtggttgtacagctcaagtgcctgccATGGCCGAAGCGCCCTCTCGAATG 
C\GCAGAACTATGACTGGCAGTGCGAGGATGCTATCAAC\CCCACATCCAGCTGTGCCT 
CTATGCCTCCTACGAGTACATGTCGATGGCAGTCTACTTTGACCGTGATGACGTGGCCC 
AGGAGAACTTC\AGCGTTTCTTCTTGACCAAGTCACAC^CTGCCAGACCAGTGCAGAG 
ATGTTCATGCACCTGCAGAATAAGCGTGGAGGCTGCATCTCCCTTCAGGACATCGCGAG 
ACCAGAACGTGACAGCTGGO^CGGGGGATTTCAAGCCATGGAATGTGCCTTCCACATGG 
AGATGCTGATC AAC CAGAGC C TGCTC AAC A.TGC AC GAAGTGGC CAAGGAG AAAGGC GAC 
CCCCACCTCTGCCATTTCCTGGAGCAAAACTGCCTAGATCAGCAGGTCGACATTTTGAA 

GGAGATGAGCGGCTACCTGACCAACCTGCGCCAGATGGGGGCCGTA.GAGCACAACTTGG" - 

CTGAGTACCTCTTTGACAAGCTCAGCCTGTCCTAAagcttcaagtggactgaactggga 
tgtctccactgtcgtggggtctttctctggtcattacacctaatgttcatgttggtttt 
gaagcaaagttcactcattttcggcttctgatggactgttgctttaaataaatttcgtg 
t gt t tg t t t gcagc aaaa t t gaaaaaaaaa 

Figure 28 

SEQ ID NO.: 28 Spg34 encoded protein 

MAEAP S RMQQNYDWQ C EDA INTH I Q LCL YAS YEYMSMAVYFDRBD VAQENF KRF ? L TKS 
HNCQTSAEMFMHLQNKRGGCISLQDIARPERDSWHGGFQAJffi 

EVAiC^GDPKLCHFLEQNCLDQQVDILKEMSGYLTNLRQMGAVEENLAEYLFDKLSLS 

Figure 29a 

SEQ ID NO.: 29 Spg39 cDNA sequence 

gtgcggtCTitgtgtticctgtcactcctgtcgcccrttgguCtcaggactgctcacctca 
cagggccagccaagcccctagagcactcagccatcA.TGAATTGCGAGGATGTCACCACC 
GGGTTCCGCCATGC C AGGGTGTTAJVTGTTCATCAACGAACAAATGGCCAAGCACTC C AG 
AGGC CCGG AGTTCT AC C TCG AG AAC CTGAC C CTGTCC TGGGAGG AGGTGG AGGAJ^-AAGC 
TCAATGTCCTCCTGGACGGTACCGAGGTGCCTCGGGATGTTCAGGAAGCCTGTGCCTGG 
AGCAGCCTGGCCCTC-GGGGTTCGCTTCGCTTTCAGGCAGGGCCAGTTGCAGGGGCGCAG 
AGTGCLAGTGGCTGCACGACTTCGCCAGCCTGCACAGGTCAGCGGCGCATGCCCTGGCAT 
TGGACCTGAAGAAGCTCACCGACCAGC^CGAGATAGAACGCAAGGAGGCGGCCTACCAG 
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CTTCTTTTGGCCCACACTAAACTCGCAGAGGTGCAGAGAGAGCGAGACCTGATGAGACT 

GAAGCTACTACACGCAAGATTTGCCACCCATATGAGA.GTTGTGCGAGACTACAGAGGAT 

GC CAGC AC C AGGTAC AAG C AAGAATGTCTAC AATTATACAGC C TC TTAAC AGAAATCGT 

CCCTAGtccccagagggcagaaccagagacagatctgaacaggaaacttctgccaactg 

ctccaagtcctcaggtagaaggaaggcgaaggactgtatctgatccggactgagacaca 

actggaagagccctatctcccagagactgtgaacctggagaatacgaagctgttgtggc 

ccatgggacacctgtagcatcagaaatgtgacttcgggttgtctgttattggtgaggat 

acagctgcctccaggattgcaggccagatccctgtcctgcaattttttgaacactcttt 

gggcttgcttacctccccctactgcagggtttcctaacctcattgcacttgacatctgg 

afctcggatag^tcttfcgctgtggggtaggaagtcctgtgtactgt&ggcagtgcggcaa 

aaccagagttgggaacaactaagttggaaattgaaggccttagcagtgtggacaaaaac 

tcaaatagccaaggtcacagaggctgaagctggcaaaagaaaaaaacaaaaaccacagt 

tctttaagartg^tgtttacttfcggatcttcaagaagctagggtggtggcagagtagaa 

accaaacggtcaggactgaggggaacctctaccctctacaacagtagtttcaaccttta 

atgtttcatctcatgtgg-ggtgacccccaaccataaaattaccttcactgactggtac 

ttcatatcatactatgatgaattataatgtaaatacctgtgtctcccaaaggtcttagg 

tgacccctgzgaaaggcttgtttgacctccataggggtcacagcccacaggctgagaac 

tgcagctttacaagaactctaggctgcaaagggaaggagtacaggaagcgccatggatc 

ctacagtgttagtttgtcagctagccaacatgcttggcaaactagattggafcaaaatct 

cgaacatggcgtttaaactctgataaaggcagtaagagttaaggt taggtactttttga 

gcaatttcctcaaagataaaacattgcaacctgcaggaaagctcatgaagactcagtag 

tagaaacgagc cat taaaggagaa t gc 1 1 taaa t gaaagaaacaagaaaaaaga tggt c 

ttatatagtggatzgggaggactagccaatagtattagaaggaaaacaacacgatgaaag 

ggaccattgttttgtgtaagataaaaacacttaaagttcttaggtatgtgcatgtatgc 

tctatgtaaatatgttacaaatttatgtctataaaatctgtattttcagtaagcattga 

aaagatatggaaaagatatttagtgaacaggcagggaacagaaggcagtgtggtatggg 

gggtatagaacagaatacagccaagtt^ - ~ 

gttttgaaagtaataaggtgtttacaaa aataaa agttauttacccacttq aaaaa 

SEQ ID NO.: 30 Spg39 encoded protein secjuence Figure 30 

MNCEDVTTGFREARVLM^ 

DVQEACAWSSLALGVRFAFRQGQLQGRRVQ^^ 

ERKEAAYQLIiLAHTKLAEVQRERDLMRLK^ 

IQPIiNHNRP . 

Figure 31a 

SEQ ID NO.: 31 Spg46 cDNA sequence 

gggctggagggctgagggtggagcgcccggcATGTGGGGCCAGCGGCTCTTTGCTGGGA 

CGGCTGTGGCGCANAGTGTAAGTTTTC^ 

TACAATAAAGTTGAAGATGTTGTTGGAAGTC^^ 

CC^GAATGTCAGTAAAAATAAGGATATTATGAAGATTGGTTGTTCACTCT 

GTC C TCTTGC T AATTCAGTTTTTGGCAATCTTG ATC C TAAGAAG ATTT ATGGTGG ATTG 

TTTTCTGAAGATAAGTGCTGGTACA^ 

GTGC CTGGTGAGGTACATTGACT ATGG A^ATACTGAAATTC T AAAC C GATC TGATAT AG 
TAG AAATTC CTCC GGAGCTACAATTTTCT AGTATTGC CAAGAAGTATAGA.C TTTGGGG A 
CTACAGATTCCTTCTGGCCAAGAAGTTACCCAGTTTGATCAGGGTAGAACATTTTTGGG 
GAGTTTGATTTTTGAAAAAGAA^TTAAAATGAGAATAA 
CAGTTATTGCTCAGGCTGAGTATGGCACTGTCGA^^ 

GGATTTGCAGAAAAGTGC\GACTGACCTC^GGCATTGATGCCTGTGAGGCAAAGAJ^ACC 
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TGATCCTAATCAGCTTGCTCTCAGGAGTCTCAAGAACCCT^TCCCCCTGTGCGGGCGCA 
GATCAAACCAGTCAACCTTCAGCAGGCCAJ^^ 

GATGTGAAGTATGAGACCAGTGCAGGCAATCACGTGACATTTCCAJ^GGAAAijTTTG^ 
TGCTGGTGACTTTAATTTAGGGTCTAATGTCAGCTTC^ 

AACTTATTG A AG AG AATG AAA^GCTTAAAAG AGAGAAAGAGG TTCTTC T AGAAAATT AC 
AAAGC ATTAGP A.TTAAAAGTTGAGC AGACTGC C C AGGAGC TGCAGG AAGAGAAAAC AGC 
TACCATGGATCTGACTAAGCATTTAGAAAGCACTCTGAAGACGTGTGTAGGAA.CCAGGC 
TGAAGAJ^TTTGGCAGCTAAAGTAGAACTATTGAAAGAAATTAGGCATATTAA 
ATTCGCTTTGG£ AJ^TGACCTTTC AG ATGCTATGCAAGTGTTGGA.TGAAGGGTC CTTTAC 
TACTCTAGCATCTTTGAAT^^^ 

AGAAGATCCAAACTTGTCTTAATGAGAATGAAGGTAATATTTTGATTGCTGAAAGAAAT 
GAACrTAC AACAGAAG C TGTTC GTGGCTGTAG ATGTTTTTATTCTGGAAGTAGATGACTT 
ACCACTTGATAA^CGCTTAAAAACATTGCAGGACTTAGCAACTTCTO 
ATGGAAAGGCCAAAGAAGGAACTAATAACTCTGAAGAAACACTTAGAAAGTTTTTTGAC 
TGGCAGTGTAC CAAAAG AGAAGAGTTC GCCAG TATTAGG AGTGAAACAGAGGC ATC TCT 
GCAGCACCTTGTGGCATGGTTCCAGAGCAGCCAGAAGGTTTTTGATCTGTCCTTGGATG 
AACCATTGACTTCAGAAGACCTGATTGGTAATATTGACGAAATTCTAGAGAAGACTGAG 
TCATGTGTC TGC^JVAGAGCT AGAGC TGTC TC TC ATTGAGC AAGGTGTCAT AGAC^A-GG A 
GATTATTTT AATT AC AT ACAG TC AAGTGC TGC AAAAG ATC C ATTC TGAGGAAAAGTTC A 
TTGCCACCTTGCTGTCCAAGTATAAGGACAGTGTTGAGTTTAAAAAGCAGATGATTGAC 
TGTTT AAAT AAGAAC CCC AATGTGGATTAC TTGC TTTC T ATT AAGAAG ACATTG AAAGG 
CTTAAAAGCACAACTGAGATGGAAATTGGTTGAGAAGAGTAATTTGG^AGAATCTGATG 
AC CATG ATGG AAC TG AAATTG AGAAAAT AAAGC AAG AAAT AAC T C AATTGC GAAAT AGT 
GTTTTCCAGGAA^TTTATCATGAGAGGGAGGAATATGAGAAGCTGAATAGCTTGACCCA 
GAAATGGTTCCCTGAGCTGCCTCTGCTGTATCCTGAAATAGGATTGCTTAAATATATGA 
ATTCTGGTGGTCTTCTTACTATGAGCTTAGAGCGGGACCTTCTTGACA.CTGAGCCCATG 

AAGGAACTTAGCAGGAJ\GGGTCGTCTGGTGTGCTCGGAGGTTAATGGC<:AGCCAGTTC-T - 

CTTAAAGGGCTATTCCGTGGATGTTGACACAGAAGGCAGGGTGATTCAGAGAGCAGCCT 
CTTACCATAGAGCTTGTGGATATGCTAAAGAAGAGTCTGGGTTACTGCCATTAATATTC 
TTGTTTTTGTGTAAGTCTGATCCTGTTGCCTATCTGATGGTCCCATATTATCCTAAGGC 
AAACCTGAGTGCAGTTCAAGCCAGTATGCCTTTAACTTCAGAAGA^GCTTTAJLAA.GTCA 
TGAAAGGTGTTGCCCGAGGACTGCATACATTGCATAGTGCTAACATAATTCATGGATCA 
CTTC\TCAGAJ\C^TGTATTTGCCTTAAATCGTGAACAAGGGATTGTTGGA.GA.TTATGA 
CTTCACCAAATCTGAGAGCCAGCGAGCTTCAGTCAACGCGATGGTTGGTGGATTGAGTT 
TGCTC TC AC C TGAATTGAAAACTGG AAAACC TC C TTCTGC AAGTTCAGAC TT ATATGCT 
TATGGTTGCCTTTTCTTATGGCTTTCTGTTCAAAATCAAGAGTTTGAGACAJ\ATGAAGA 
TGGAA^TCCAAAAGTAGATC^GTTTCATTTGGATGATAATGTCAAGTCCCTCCTTTGTA 
GC TTGATA.TATTTTAGAAGTTC AATGACTGCTG AGCAGGTTTTG AATGC TGAATGTTTC 
TTGCTTCCAA^.GGGGAAATCAGTTCCAATCCCAGAAAAAGAGATTGAJITGTACTCAGCA 
TAGCAG AG AAG AT GAATC AAAGATGGAG AG TC TGGATAG AT AT AGTGAAAAGAC AAGAA 
ATGGTGAAGCCAACCCTTGAccaacaaaccaaccctittattgttgcEgratiatgrccct 
ttcaaaaac-ctgccccgtutigctzagtagacaaaaacgtuccggaactiagtrggattg 
catctttcgatttgggctgcaaaaaacaaaagaaatgctttgaccacacctiaaaaa 

SEQ ID NO.: 32 Spg46 encoded protein sequence F*S ure 322 

mwgqrlfagtavajcsvsfpglvqi^ 
kigcslsevcplai*svfgnldpkkiyc^^ 

TE ILNRS D TVE IPPELQFSS I AIC^YRL WGLQ I ? S GQ EVTQFD 
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QGRTFLGSL IFEKZ I KMRIXATY QDGTVIAQ AZYGTVD IGE£VAKXGF AEKCRLTSGID 

ACEAKXPDPNQLAJLRSLKNPIPLWGRRSNQSTFS 

FPKZSLAAGDFNLGSOTSLAXIKQDQKLIIENEX^ 

LQQEKTASMDLTXHLESTLXTC/GTRLXNI^ 

LDEGSFTTIJ^SLNELEKIWAEYW^^ 

ILEVDDLPLDKRLXTLQDLATSLES^^ 

SETEASLQHLVAWFQSSQKVFDLSLDEPLTSEDLIGNIDEILEKTESCVCKELELSLIE 
QGVIDKEIILITYSQVLQKIHSEEXFIATLLSKYXDSVEF^ 

IKKTLKGLKAQLRWKLVEKSNLEESDDHDGTEIEKIKQEITQLRNSVFQEIYHEREEYE 

KLNSLTQKWFPELPLL YPE IGLLKYMNSGGLLTMSLERDLLDTEPMKELS SKRPLVCSE 

VNGQ PVLLKGYSVD VDTEGRVI QRAAS YHRACGYAXEESGLL PL IFLFLCKSD P VAYLM 

VP YYPKANL S AVQ ASMPLT SEEALXVMKGVARGLHTLHSANI IHG SLHQNNVT ALNREQ 

GIVGDYDFTKSESQRASWAMVGGLSLLSPELKTGKPPSASSDLYAYGCLFLWLSVQ^Q 

EFETNEBGIPKVDQFHLDDNVKSLLCSLIYFR 

EIECTQHSREDESKMESLDRYSEKTRNGEANP . p . . 

SEQ ID NO.: 33 Spg58 cDNA sequence figure J^a 

caaagtctgATGGAJVTCTGAAAAAACAAAGATGGAGAGTGAAAGTTTGTGGATGATCTC 

TGATTCTGAGAGTTATTCAGTGGACTCACACACA.GAA^AAGGTA.GAGCATCAGTA.TCTA 

AAATAAATCTGATACAAATTGATGAAACAGAAAAACCAAGAACTAAGAGATACTTGATG 

GAGTCTGATTCTGAATCAA.GTAATACAGACTCAGATTCAGAAGGATGTGAGCTAGCCTC 

AGCAGCTGTGAAATACTTCATAGCTACAAAGACATTTCAGCAGAGCAGTGCTAGCAGCC 

AGTTCCCAAAGGATTCATGGTCTGCAAGTAGAACAATAAACTCAGACTCTGAAAGCCCT 

GTGATGTCCTCTGATTCTATGAAATACATGAAGAAAGCTGAAACATGCAAGAGTACCTG 

TAATTTGGAAAGACTCAAGGCGGCTCACAAGTCTGAGTCTCTACAGGACTGGTTAGATG 

CTAAAAGAAAACAATTAGA.TTCTGATAATGCTGGATACTGGGATAGCTCTGGAJ^-ATAT 

CAGTTTAGTTCTATAGTACCTCAAGAGAGCGTTGGAAAACGTCAGGGTGACCTTCAAAC 

- - - GTTTCAGCATAGCACTGAAAAAAAGGA^^ 

AATTTGGAJ\ATGAAJ^GAGATCA 

GAGACGGGATTAGACAATGAGGGGTTTCAJ^.TGGATGAGGAJ^GAGA^GGATGTCTTGT 

GGAATCTGACTTTCGTGATTCAGAACGTGAGGCACACCTGCTAGAAGCCCACCGGCTTG 

GTGCTCGTAAAAAGGAGAATCGTCCTCCAGGGTTTTGGAGACCCATTATCCTGCCTCCT 

AAGCTTGCCCAGGATAAGAAJ^CTGAAGAACAAA^^ 

AGTTTCTAGAATTCAGCTAACAAGATACA^^ 

AGGCATCTTCAAGTCTCAGTGACAAGCCACT 

AGCTATAGCTTTTCTCCAGACTCTCCCACATTCACAGATGAAAAGCATCACAGAAAAGC 

TAGCCTGAAAATATCTGGCTATAAGCGCCAGTGCAAGGAATATAGATATCCCCATAGTT 

GTGAAAGTTTGAAGTATCAAATCAGTCCCATTCCCCTAAGTTCTGAGACTTGCACCTCT 

AATGTATCATCATTTGTTGACAGCCCAACTTCTAAAAGTCCTAAGTCTGTCAaAAGA^ 

GAAGTCTCGAAGAAGTATTACATATTCTCCAGAAC^^ 

GTTTTATGGAAJVTCAGTAACTCATCT^^ 

GACTCAGACTCTCCATTACATGGCCAGATTTCCTCACA'TTCTAAATATTCTCTGCGCTC 
TAAAACTATCAGACACTTCAAGACTTCTCGAAATCGCCCTTTATCTCAATCCCTGGATC 
CTCAGCATTCTGTGGTCAGCCGCTGTTCTCTGCACAGGGAAGATTCTAAACATTCTATA 
GATTCCACCTCTTATTTACATTGTGAAAGTTGTGCAT^ 

' TTCTGTTACACACACCATTTCTAA.GACCACTAAGAAAATCATGGGCCAACATAGTACCC 
ATGGCCACATATCAGGACCTGTAA.GATTGTCTCAAAGTGAAAGCAAGTTCAACCTTACT 
GCTCAACCTCAAAJ^TGAGGATACTCCTGATGTCAGTGATA.GAAATATCAAAGCTGA.GGC 
TAATGTTGAAGATAAACTTCTTTACAAAGATGA 
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CTGAAG^TGAAACGGATGGTGAAGATGAW-CAGACACTGAAGATGAAGATGAAGATGAT 
ACC^AAGATAAAAAJ^GATCCTAAAGACAAATCTGACCCTGATGGCAGTGATCCCAAAGA 
TGGCAACTCTGAAAATAATACTGATAGCAACAATGGGTCTCAACCTAGTGGTTCTTCTG 
GACCTACA.GGTGGACCTGATTCCAGCAATGATGGTGACTCTAA^AATGTAACTGATCAC 
AAGAGTGAATCTGACCCTACCATTGATAACGCTACCAACAGTGATGTTAACTTGAAATA 
TAGCAC^GATGAGACATGTACCAACAATTTAGACAATGCTTCAGATCTGGCAGAATATT 
TCAA^CATCACAATAATGCTGACTTCAAGGGTCGCACAAACCCAGCCTCTGGAAACAAA 
ACTAGAACCATACTGGACTATATTTCTGGTTCCAACAATGAGGACACTGGCCCTAGAAA 
TACGATGATAAAAGAAAATATTGCTTATTCTGAGAATATTAGATTGCTTTCCAACAGTT 
ATCAAAATAATGTCATTAAAAATGGGAGTGAACCAAGCAGCAACCCAAGCCCCCAAAAC 
AGCTATGGGCTCCCAAAAGACCTTGACTCTAACTCTAATATCAATCCCAGTAATGCTAC 




A(jACA(iU r^-riU 1 Lrti iu.-.uni.ii ^u..^— 

GTAAGGTCAAGTTTTGTAGTCAACTCAAACTATTTTGACAGAAAGAAATATGCTGGTAG 
ACTCAGCTTTGCACTTCACACTATCAATGCCATTGATACAAATAATGTTATCACCTGTA 
CTAGTGCTGTTAGGTCTCAGTTTGCTTCTGAAAAAACCTCTGTCCTA.GACACTAAACAT 
TCCCCTAGATTTAGTCGTTTCAGGAGTTTCAATGTTATCATCAC-CCCAAATTATAATAC 




TTTATTGCTGGAACAAACTATCCTGATTTTTTGATAACCTCGGAATTTCATGAACCCCT 
TAAGCTTTGTAGAGCTTATAAAATTTTTGATAACCAAAATATTGATGCTCCATTCCAAG 
ACTCTGCTGGATACATGGACTCTGATAATTCTACATATGCCACTGGGTCCATGGTTGCC 
— rr,^,m^^^» » a rnnrpnnr r m«VTn'^ z a a^A'PT'TT'CCTAGGATCCAGAATACAATTGG 




CATCAAGG?TCCTTCTTCCCCTTT(JAAU^'i-Vii TfiuwAn.<uu»"ii ^ --' . 

GTTTCGATGTTATAGTGGAAGCTGAACTGCCAGATATTATGAAGTTTACTATATCATCA 

GGTGCTGTGAATCAATTACTTCAGCTCAGACTCCAAACAGGTAGCAGACAAAATGTTGA. 

CCTTTGAttaatgaaaagaataccttgggatggaaaagacaaaagcacaaccaagaagg 

ttctgaggagatgagcaaataacttcaaaagaactacagtgttccctgaacaatgttta 

ttttatgtttattcttagatattcaccccawtatatgcccatgtttctaetgtctattg 

tggccacaaatacttcaaagtaccatgtgacaatcgccccttccatatgatttacatgt 

ggcaaagtatggaattaggaaaaggacacactgtagtccttttctagacagcatccaaa 

aataaattttactactatgtattcataatacagatgagcccttcaagcaaattcttcct 

tgtattattctctgcattttgaagaagagggccttaactfctaaaaaaatttaagacaga 

taaaattttttctagtattgtggtgacaattctagattcaaaagaacccatactaaatc 

tataatttattgttaatcttaattgttgttactgttttgtgagcttggagctcgaaatt 

aaaatagttaagactcataaaaaa 

Figure 34a 

SEQ ID NO.. -34 Spg58 encoded protein sequence 

MESEKTKMESESLWMI SDSESYSVDSETEKGRASVSKINL IQ IDETEXPRTKRYLMESD 
SESSNTDSDSEGCELASAAV1CY?IATKTFQQSSASSQ??KDSWSASRTINSDSES?VMS 

SDSMKYMKK-AETCKSTCNLERIjLAA^^ 

SIVPQESVGKRQGDLQTFQHSTSKKEVGSSSDKHQAQFGNERDQSDPKSKRYLIKTETG 

LDNEGFQMDEEREGCLVESDFKDSEREAKLLEAKRLGAiUCKElirRP PGFWRP I IL P - 

QDKKTEEQKSVQQKDNRVSRIQLTRYKSEDKNVRFEEASSSLSDKPLSQEKLKKKHSYS 

FSPDSPTFTDEKHERKASLKISGYKRQCKEYRYPKSCSSLKYQISPIPL5SETCTSWS 

SFVDS PTSKSPKSVTRKKSRRS ITYSPEQGSQKCTRCFME I SNS SYKKCL iNSDDSDSD 
SPLKGQISSHSKYSLRSKTISHFKTSRNP^LSQSLDPQHSWSRCSIiiREDSKhSiDST 

SYI^CSSCASLQNLKGSSVTHTISKTTKKIMGQHSTHGEISGPVRLSQSESKFNLTAQP 
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QNEDTPDVSDRNIKAEA^/ZDKIL^ 

XKDPKDKSDPTCSDPKDGNSZNNTDS^ 

SDPTIDNATNSDVNLXYSTDETCTN^ 

ILDYISGSNNEDTG PRNTMIXZNIAY SENIRLL SNSYQNNVTXNGS E? S SNP S PQNS YG 
LPKDLD SNSN IN? SNATNNTVNPNYGAXS T S TAIYXKT AALNYY SD I ND VTGFTYEVRS 
SFWNSNYFDRKKYAGRLSF AI»HT INAIDTNNVTTCT SAVRSQ F AS EK? SVLDTKHS PR 
FSRFRSFNVT I S PNYNTKNSQNANKSSISNIYNIiPATELEINILSVLKI IYGNTPNF IA 
GTNYPDFLITSEFYEPLKLCHAYKIFDNQNIDAPFQDSAGYMDSDNSTYATGSMVALDA 
KESGFLKYFPRIQNTXGIKDPSSPFKVFSNQNILVTSFDVIVEAELPDIKECcTISSGAV' 
NQLFQLRLQTGSRQNVDL . 

Figure 35 

SEQ ID NO. :35 Spg59 cDNA sequence 

ttttttttttctttgctttcgattttgttttctttaaatgtaccttaagagaaaaggag 
aagagaatgccagaaaactaagatggttcaattaatccacttccttcagccttatgccc 
ttggataggacatcctccatacaccggccATGGTCCCTCGGAAAGCCCACAATTTCTCC 
TGCTGCTTC CTTAAATATTTCA.GAGCCC CTAGAGTATGCCTCC^ACCCCTGC TACCC CA 
TCTCCC^CAAACCACCTTCCCAGGCCCCAGGGTGGGGAC^GGCACCAGAGTGGGCACCT 
CAGGAGATTTTCCGAGGTTGTCTTCAGTGCCCGCTACATTGGAAAACTGCCTGTTTCCT 
AGTGGC^CCAGGAGGGTTGCCTCGACTCCACGTGCATGCTATCCTTTCTGTTTATTTCG 
GTTATTCCAAGACCCAAAGACCTTCACACCCACCCACCCTCCGCCGCCTGCCGTCATGA 
AGGTTATAGAGTTGAGACCAGCAAGTCTTGGATGTGAGGGCTTTAATTTATCTACCTCA 
ATTATCTTTATTTTTGTAGCCAAAAGTCTTCTCTATTTTGCTATTTTTGCGACCACCCA 
AGTATTGCCAGGTCTTAAACCTAAGAGTTCTTATACTGGTAAGAAGGCCCCGAAGCTCA 
AGAAGTCTTCGTGGTTGGTTCTGTGGGTTTTGTTTTTGTTTTTAATTACAT'I'TTTGTTC 
GTGTAAtaattctgaacagctzgttgtaacagtgttccaatggtttagtgaggtaggaga 

gatttctccagtctctgggtag^ 

gaagatggagactctcgoccagtgtttcctctgacttgttgcacctgattttaggatcc 
aaaacttgttttgttgtcacttctttctttttctctttctttttttcgccaagtcgtgc 
ctttccttctgggattgtaagtgaaccccgtgatcaagtgttcaccggttcacctgttt 
acactgtgaagtgggcatcctacggagaacacaccagccttggtcccctcacgccccgt 
gaatgaatgtgtacgccacagttttacctgctgatactgtaaggtccgaaaataaagtg 

aatgctttccctccacg 

Figure 36 

SEQ ID NO.: 36 Spg59 encoded protein sequence 

MVFRXAHOT SC CFLKYFRAPRVC^ 

PATLENCLFPSGTRRVASTPRACYPFQliFPJiFQDPKTFTPTHPPPPAVMKVIELRPASL 
GCEGFNLST S I IF I FV AKS LL YF AIF ATTQ VL PGLKP KS S YTGKXAPKLKKS S WL VL WV 
LFLFLITFLFV 



SEQ ID NO.: 37 Spg64 cDNA sequence * 

GGC ACGAGACTATT TCTTCGTACAGGAGAAG ATTCCCG AACTGCGC C GGCG AGGC CTGC 
CGGTGGCCCGCGTGGCAGACGCCATCCCCTACTGCTCGGCCGACTGGGCGCTCTTGAGG 
GAGGATGAGAAGGAGAAATACTCAGAAATGGCTCGAGAGTGGA.GAGCAGCCCAGGGAAA 
GGATTCTGGGCCTTCAGAGAAGCAGAAACTTGTA'IKZTACACCACTGAGGAGGCCAGGCA 

TGCTTGTACCAAJ\ACCAAGTA?TTC?CCCCCTGATATGT 

GATCAAGCTCTCCTTGGAGGCATTTTTTATTTTCTGAACATTTTTAGCCATGGTGAGCT 
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Figure 37b 

ACCTCCTCATTGTGAACAGCGCTTCCTCCCTTGTGAAATTGGCTGTGTTAAATACTCCC 

TC C AGGAAGGTA.TTA.TGGC AGATTTCCAC AGTTTT ATC CA.TCC A-GGTGAAATTC CAC GA 

GGATTTCG ATTCCATTGC C AGGCTGCAAGTGATTCTAGTCAC AAG ATTC CTATTTCAAA 

CTTTGAATTCGGGCATGACC^GCAACTGTGTTACAAAACCTCTATAAATTTATACATC 

CAAACCCAGGGAACTGGCCACCTATTTACTGCAAGTCTGATGATAGAGCCAGAGTCAAC 

TGGTGTTTGAAGCGTATGGAGCGGGCATCAGAAATAAGGCAAGATCTAG 

TGTAGAGGACCTTGTAGTTGGGATCTACCAGCAAAAATTCCTCAAGGAGCCCTCTAAGA 

CCTGGGTGCGAAGCCTCCTAGATGTGGCCATGTGGGACTATTCTAGCAACACGAGGTGC 

AAATGGCATGAAGAAAATGATATTCTCTTCTGTGCTTTAGCTGTTTGCAAGAAAATCGC 

GTACTGCATCAGTAATTCTCTAGCCACTCTGTTTGGAATCCAGCTCACTGGAGCTCATG 

TACCACTACAAGACTATGAGGCCAGCAACAGTGTGACACCCAAAATGGTTGTATTGGAT 

GCAGGGCGGTACC AGAAGC TAAGAGTTGAGAGTC CAGGATTCTGTCATTTCAAC XCTTA 

CAATCAGGAACAAAGATCAAATACATCTACTGGTTATTATCCATCTGGGGTGAi^AATOT 

CGGGCCCTCACAGCAGTGTTCGCGGAAGAGGAATTACCCGCTTACTAGAGAGCATCTCA 

AACTCCTCCAACAACATCCATAGATTCTCCAGCTGTGAGACTTCACTCTCA.CCTTACAC 

GCCCCAAAAAGATGGGTACAAACCTTTCTCCTCCTTTTCTTAATGATGGTACTTTGTGC 

GATTTCTGGAAAAATAACAAGCCAACTTCCTT^ 

CACATCAATAGTAAATGTCA.CTCCTAAAACCTACTTAATTTGTAAGGAAACTATTTCAT 
AGATTAAAAGTAATTGTGGTTGGAGAAG 

Figure 38 

SEQ ID NO.: 38 Spg64 encoded prtoein sequence 

MAREWRAAQGKDSG P S EKQKL VSTPLRRPGMLVPKPS I SF PDMSNLS IKSDQALLGG IF 
YFLNIFSHGELPPHCEQRFLPCEIGCVKYSLQE 

SDS SKECI P I SNFEFGHDQ ATVL QNLYKF IKPNPGNWP P I YCKSDDRARVNWCLKRMERA 

SEIRQDLELLTVEDLVVGIYQQKFLKEPSKTW^ 

FCAIAVCKKIAYC I SNSLATLFGIQL 

ESPGFCEFNSYNQEQRSNTSTGYYPSGVKISGPHSSVRGRGITR^ 
SSCETSLSPYTPQKDGYKPFSSFS 

Figure 39a 

SEQ ID NO. : 39 Spg65 cDNA sequence 6 

ggcacgagccaaagagtggcccaaaacctcagctcccatagaggstaccctatcccaac 
cggagaaactcttatttgtcatcgacaacttggaagtgATGGAATGTGATATGTCTGAA 
CGGGAGTCGGAGC TGTGTGATACCTGCACGGAGAAGCAGCCATTGCGTATC CTGC TGAG 
CAGTTTGCTCAGGAGGAAGATGCTCCCCAAATCCTCTTTCCTCATCTCTGCTACCCCAG 
AGACTTTTGAGAAAAT GGAGGGCAGGGTTGAGTGC AC AAATG TGAAAATAGT AAC AGG A 
TTCAATG AG AGC AAT ATT AAGATGT ATTTC CGCAGC TTGTTC C AAGATAAGAC CAAAAC 
ACAGGAAATCTTCAGTTTGGTGAAAGAAAACCAGCAGCTGTTCACTGTATGTC^ 
CTGTGCTCTGCTGGATGGTGGCCACTTGTCTAAAAAAAGAGATAGAGAAGGGAAGAGAC 
CTGGTCTCTGTCTGCCGACGTACCACTTCCCTGTATACCACTCAOATCTTCAATTTGTT 

C ATTC CC CAAAGTGCC CAATATCCAAGTAAGGAAAGCCAAGCTCAGCT 
GTTCTCTGGCCGCTGAGGGTATGTGGACTGACACATTTGTGTTTGGTGAGGAGGCTCTC 
AGAAG AAATGGGATCATGG ACTC C GAC ATC C C CAC AC TGCTGGAC GTAAGGA.TC CTTGA 
GAAGAGC AAG AAATC TGAAAAATCTT ACATTTTCCTCCACCCGTCT ATC C AGGAGGTC T 
GTGCAGC CATCTTTTATCTGCTAAAGAGC CAC ATGGAC CAC C C T AGCC AGGATGTTAAA 
AGTATAGAGGCACTCATATTTACATT^ 
TGGCTCTTTCATCTTTGGCCTTTTAC^ 
TTGGCCACCAGTTGTCCCAGGAAATAJL^ACGTCAGTTGTATC 

AGTGGCAACGAAGAGCTTCAAGAACAGGTAGATGGCATGAAGCTGTTTTACTGTCTGTT 
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Figure 39b 

TGAGATGGACGATGAAGCCTTCTTAGCACAAGC\A.TGAACTGTATGGAACAGATTAACT 
TTGTGGCTAAGGATTATTCTGATGTTATTGTTGCTGCCCACTGCTTACAACACTGTTCT 
ACACTGAAGAJ^XTATCCTTGTCAACCCAGAATGTCCTGAGTGAA.GGTCAAGAACACAG 
CTATACGGAAAAGCTACTCATGTGTTGGCATCATATGTGCTCTGTGCTCATAAGCAGTA 
AGGACATCTACATACTCCAAGTGAAAAACACTAATCTCAATGAAAC^GCCTCTTTGGTG 
TTATATAGTCATCTGATGTACCCCAGCTGCACCCTTAAAGCACTTGTGGTAAATAATGT 
GACCTTCCTATGTGATAACCGCCTGTTCTTTGAGTTGATTC^GAACCAGTGTTTGCAGC 
ACTTGGACCTCAACCTCACATTCCTGTCCCATGGTGATGTGAAACTGTTGTGTGATGTC 
TTGAGCCAGGAAGAGTGCAACATAGAAAAGCTGATGGTAGCAGCCTGTAACCTTTCACC 
AGATGACTGCAAGGTCTTTGCCTCCGTTCTGATCAGCAGCAAGATGTTAAAGCATCTTA 
ATTTGTCATCTAACAACTTGGACAAAGGGATATCCTCTCTGTCCAAGGCTTTGTGCCAC 
CCAGACTGCGTTCTGAAGAA.CTTGGTGTTAGTCAA.CTGCTCCCTC\GTGAGCAATGTTG 
GGACTACCTTTCGGAAGTrCTTAGGCGGAACAAAACACTGAACCACCTAGACATCAGCTT 
CCAATGACCTGAAGGATGAAGGGCTGAAGGTTCTCTGTAGGGCTCTGAGTCTCCCAGAC 
AGTGTCCTGAAGTCACTAAGTGTAAGATATTGTCTCATCACCACTAGTGGTTGCCAGGA 
CCTGGCTGAAGTCTTGAGGAAGAACCAGAACCTGAGGAACCTACAGGTTTCAAACAATA 
AAATAGAAGATGCTGGTGTGAAGCTCCTGTGTGATGCTATAAAACATCCCAACTGCCAC 
TTAGAGAATATTGGATTGGAAGCCTGCGC^CTAACTGGTGCCTGCTGTGAGGACCTTGC 
TTCTGCTTTTACCCACTGTAAGACCCTGTGGGGAATCAACCTGCAGGAGAACGCCTTGG 
ACCACAGTGGATTGGTTGTACTGTTTGAGGCTCTGAAAC\GCAACAGTGTA.CCCTGCAT 
GTACTTGGACTTCGAATTACTGACTTTGATAAGGAAACCCAGGAGCTCCTGATGGCTGA 
GGAAGAGAAAAACCCACACTTGAGCATCCTAAGCAGTGTGTAAggcagaagcagaaaac 
aaaggtggatgttctgctgcaagaaacatggctgtgttctgacctcaaactacctccaa 
aagaaagagagcaggatccttaatttggcccattatatacaaaaattacaggtcactaa 
cattccaatgagatacatacagtttctttacctccccccccattcagatgtgttttj;ca 
aga taga t gt gac 1 1 ;fct t gt t tgcactacagattcaaacaggccattcaaagacagtta 
tggtaaaatgtctgccatataatgacagtttttcacacacttgtattctaagcatacaa 
taaagtcacttttaaagataaaagtatctttagaaatcccttaagaagagatttgcctg 
ttggtggattactgggcctataatgtcggtccaggcaatgatggtgccccacaaagctc 
tttagagaaatggacaaaggttggggaaatgatgatggaaactgttgctgttttgtgtt 
ttttattaaat aataaa tttaggcattttct aaaaa 

Figure 40 

SEQ ID NO. :40 Spg65 encoded protein sequence 

MECDMSEI^SELCDTCTEKQPLRILLSSLLRRKKLPKSSFLISATPETFEKMEGRVECT 

NVKIVTGFNESNIKMYFRSLFQDKTO^ 

EIEKGRDLVSVCRRTTSLYTTHIF 

VFGEEALRRNGIMDSD IPTLLDVRILEKSKKSEKSY IFLHPS IQEVO^AIFYLLKSHMD 

HPSQDVKSIEALIFTFLKKVKVQV\TTFFGSFI?GLLHESEQKKLEAFFGHQLSQEIKIIQL 

YQCLETI SGNEELQEQVDGMKLFYCLFEMDDEAFLAQ AMNCMEQ I3STFVAKDYSDVXVAA* 

HCLQHCSTLKXLSLSTQNVLSEGQEHSYT^^ 

NETASLVLYSHLMYPSCTLXALWNW 

VKLLCDVTiSQEECNIEKLMVAACNLSPDDCKVFAS^ 

LSKALCKPDCVTiKNL^ 

RALSLPDSVLKSLS^YCLITTSGCQDLAEVLRKNQNLRN^ 

IKHPNCKLENIGLEAC ALTGAC CEDLAS AFTKCKTL WG INLQ ENALDKSGL WLFEALK 
QQQCTLHVLGLRITDFDKETQELLMAEEEKNPKLSILSSV 
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Figure 41a 

SEQ ID NO.: 41 SpgSB cDNA sequence 

tggcagcatcactcaggcaagcgccacgagagcccgcgcttcccgtcaggtCTictgtga 

gtttgaggccagcgggggagacagagggaccggagggctccggggcggctcggagacat 

cttgcctctgtccatctccggaatcccccctgaagatccatcaggATGAGCTGCAAGAC 

TCCACCCACACTCCAGGAACTGGCAGAGAACAGCCTCCTGAAGAACCAGGACTTGGCTA 

TCTCTGCTCTGGATGACATACCCTC^CTTTTCTTCCCATCACTGTTCT AAAGGCCTGC 

AGAAATAGATATGTTGGGATCATAAAGGCGATGGTGC^GGCGTGGCCCTTCCCCTGTCT 

TCCTCTGGGGGCCATGATCAGTAGGAAGACTGCCTACAGGAGAATCTTAGAGATTATCC 

TGTATGGGCTTGATGCCTTGCTTTCCCAGAAAGTTCCCCACA.GCAGGTGCAAGCTGCAA 

GTGCTGGATTTACGGGTTATGCCTTTGAAGCTGTGGAACAGGTTGCCTGTGTTTGGGAC 

TGCTGGCTGCAGTGAGAATCCAGCAGTGGTGGGCCATTCGGGAACAGAGGTGAAACAGC 

CAGTGAAGGTGCTGGTAGA.CCTGGTCCTCAAGGAAAGCCOVCTAGATTCCACAGAGTCC 

TTCCTCGTTCAGTGGGTGGATAJ^CAGGAATGGTTTGGTGAGTTTGTGCTGTTGCAAGCT 

GCAGATCTGGGCTATGTCCATGTATTACCACAGAAAACTTTTGGAGATTTTGGATCTGG 

ACTCTGTCCAGGAGCTGCGTATGTACTGCATCAGTAATCCTGTCTGCCTGCTTAACTTC 

GCCCCTTACTTGGGTCGCATGAGGAACCTGCGCTGCCTCATCCTCTCTCACCTCTGGCA 

GACCTTCTCGATGACCCCGGTGGAGAAGCAGCAGGTTATTACCCAGTTCACGTCTCAGT 

TCCTCAAACTGAAATGCCTCCAGATCCTGCATCTGGATACTGTCTTCTTCCTAGAGGGT 

CATCTGGATGAGCTATTCTGGTGGCTGAAGACACCCTTAGAGACCCTGTCTGTGATTGA 

TTGTAATCTCTCAAAATCAGACTGGTTCC^TATATCTGAGTTCCAGTGCACAA^ 

TAAAACACCTGAATTTGAAATGGGTCAAACTGACCCATTTGAGCCCAGAGCCCCTTCGA 

GTTCTGTTACTAAAATCTGCATCTACCCTAACATCCCTGGATTTGGAGGGCTGTCAAAT 

GATGGACTCTCAACTCAGTGCCATCCTACCTGCTCTGAGATGCTGTACACAGCTCACCA 

AGTTTAATTTCCATGGGAACTATATCTCCATGCCTATCCTGAGGGA.GCTGGCATATAAC 

GTTGTCAAGC^GAAATCCCAAC^GTCAAAGATACGCTTTATCCCAAGCTGTAGTCATCA 

_ CAGTGGTGTGGAGTTTGAGGCGATTTCTC^GGCTCATATTGTGTTTGTAGA.TGTGGATC - 

GGACTACTGGGGAGCAAGAACAAGTCTTGTTTTATGCTATTTGTTCTGGAGAATATGTG 
TTATGAtatctctacaatgtactaggacttgttagactcctacctaatgagatacagat 
ttaagctctcttgtggacactgggatgtgtccggtcagagtatgcacaacacacgcttit 
tagactgcagttcttgcgagtgagaagagaaagttgacgctagagtgcaaacaagtgag 
agtatttgacatagggaacgtgggacacattgacagaagtcatgggagtgggttccggt 
tgggcaggtacaaaagggcattcctccagcctggtgcttttttgatttutgtagtaaaa 
aatgggaccacctgtccnacctggacgattcaaatcctgtgactgctaggaacaaactt 
gttttacgctattcgctggttcttctactttgatcttctgagtgctgagtataggctta 
tagcaccagacccggtnccctgtagttcctctaacatgtggctgtacctctcatctttc 
tagtcaaacccagtattttatatagtagcagtcgttgtggggctagagagatggcttat 
cagttgagagcacttgacgcttttgtagagtacctgggtttgattccccacatttacat 
ggtggctcacagtcatccagaactactuatggggtgcatgaggcacgcatgtagtgcat: 
agacatacatgtaggcgaaacactcagacacguaactctaaaacaattinaaaaatrgac 
ttgtaccgatgagactgaacgtctagctgaggcagccagcacccagtacttctacgaca 
gacactgctiaagccatgcatittacacccg-aaaatgctacccatgagcctagcaaattg 
aaccaggaatacatgggugagtagacaataaactctagcaggaaaaaccccaagcztct: 
atgacaazatcaaacaccccagaccactigaaggaugcaatccccatcccgaaccaccta 
* aagaggggcatcagguactgagccatcagactgggttgaaatgagctcctgactcugtc 
cctaggctacagcacaact:gaaccccactgctgaacctctt:t:aatttttctgtggt:caa 
tgaaccgtgtCwacaaatgaggcrg-ggccaaacgtcaggtguctgtgcatcttigcncc 
catecacctigacaaggct-ticatiggaaccagaagcttgatggTitcacctagaccuagga 
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ctgttagccagagggctcctataggtctgccagticcctgtgctggagtcacaggctcat 
gcatcuatggctgacccctitctgtcccccataccctccgcttccrgaataatgctgca 

Figure 42 

SEQ ID NO.: 42 Spg69 encoded protein sequence 

MSCKTP FTLQELAENSLLKNQDLAI SALDD I ? SLFF PSLFKKACRNRYVG I IKAHVQ AW 

PFPCLPLGAHISRKTAYRPJILEIILYGIiDALLSQKVPHSRCKLQVL 

PVFGTAGC SENP A WGHSGTEVKQ PVKVL VDL VLKE S PLDS TE S FLVQWVT)NRNGL VSL 

CCCXLQ IWAMSMYYHRKLLEILDLDSVQELRMYC I SNPVCLLNFAPYLGRMRNLRCLIL 

SKLWQTFSMTPVEKQQVITQFTSQFLKLKCLQILHLDTV^ 

LSVIDCMjSKSDWFHISEFQCTSQLXHLjNLKW^ 

EGCQMMDSQLS AIL PALRCCTQLTKFNFHGNYI SMP ILRELAYNWKQKSQQSXIRF I P 
SCSHHSGLEFEAISQPHIVFVDVDRTTGEQEQVLFYAICSGEYVL . 

Figure 43a 

SEQ ID NO.: 43 Spg70 cDNA sequence 

ggcacgagtaggcctgtacagcaaagtttgaacaagcttgaagataataaatcaccatt 
tgaaacaaaggccattgaagtgaagagtgaggttgactgtcccccgggagttactaaag 
aaataacagcgggtgctgagagagtaATGTTCTCTGATTTGAGAAGTCTCCAACTCAAG 
AAAACCATGGAGATAAAGGGTACAGTTACTGAJ^TTCAAGCACCCGAGTAACTTTTATAT 
CCAGTTGTATTCTTCAGAGGTTCTAGAAAACATGAACCAACTCTCTACAAGCTTGAAAG 
AGACATATGCAAATGTGGTGCCTGAAGATGGTTATCTTCCTGTTAAGGGGGAAGTTTGT 
GTTGCCAAATACACAGTTGATCAGACCTGGAACAGAGCCATAGTACAAGCCGTGGATGT 
GCTGCAGAGGAAGGCCCACGTCCTGTACATTGACTATGGGAACGAGGAGATGATCCCGA 
TAGACAGCGTTCACCCGCTGAGCAGAGGCCTTGACTTGTTTCCTCCTTCTGCCATAAAG 
TGCTGTGTGTCAGGCGTCATTCCCACTGCGGGCGAGTGGAGTGAAGGCTGTGTTGCAGC 
TGTCAAGGCCCTTCTGTTTGAGCAGTTCTGCTCTGTCAAGGTCATGGACATCTTAGAGG 

AGGAGGTAGTCACCTGTGCCGTTGAGCTTGTTGTACAGAGCTCAGGAAAGCAGCTGGAC 

CATGTGCTGGTGGAAATGGGGTATGGAGTGAAACCCGGTGAGCAGAGCTCCACGGAGCA 
GAGTGTGGACCACAGTGCATTGGAGGACGTTGGAAGAGTGACAGTTGAGAGCAAGATTG 
TGACAGACAGAAATGCCCTGATCCCC^AAGTGCTGA.CTTTGAATGTGGGTGATGAGTTC 
TGTGGCGTGGTTGCCCACATCCAGACACCAGAGGACTTCTTTTGTCAGCAGCTGCAGAG 
CGGCCACAAGCTTGCGGAGCTTCAGGAATCCCTCAGTGAATACTGTGGCCATGTGATTC 
CACGCTCTGACTTTTATCCAACCATTGGGGACGTGTGCTGTGCTCAGTTCTCAGAGGAT 
GATCAGTGGTACCGCGCCTCGGTTCTGGCCTACGCTTCTGAAGAATCTGTCCTGGTTGG 
ATATGTCGATTATGGGAACTTTGAGATTCTCAGTCTGAAAAGACTTTGTCCCATAATTC 
CAAAGTTGTTGGATTTGCCGATGCAAGCTCTAAATTGTGTGCTGGCAGGCGTGAAGCCA 
TCATTAGGAATTTGGACTCCAGAAGCTGTGTGTGTCATGAAAGAGATGGTACAGAACAG 
GATGGTCACAGTGAGAGTGGTGGGCATGCTGGGGACCAGGGCCCTGGTGGAGCTCATCG 
ACAAGTCGGTGGCTCCTCACGTCAGCGCTTCTAAAGCTCTCATAGACTCGGGCTTTGCC 

ATCAAAGAAAJ^GGACGTGGC^GATAAAGGCAGCAGTA.TGCACACAGCCAG 
GGCCATTGAAGGTCCAGCAGAGGCGTTGGAGTGGACGTGGGTGGAGTTCACTGTTGACG 

agaccgtggatgtggtggtctgcatgatgtacagtcccggggagttctactgcc^cttt 
cttaaagatgatgccttagagaagctcgatgacttgaatc^gtccttagcagactactg 
tgcacaaaagccgcccaatgc<:ttta^ggcagagatagggcggccttgctgtgc 

tttcaggtgacggcaactggtaccgggctctagtcaaggagatcttacc^ 

gttaaagtccactttgtggattacggaaatgttgaagaagttaccacagaccaactcca 

ggcgatattaccacagttcttacta.cttccatttcaggggatgcagtgctggctagtag 

ATATACAGCCCCCAAACA^GCATTGGACAAAAGAGGC^ 
GTTGTGGGGCTCAAACTCCAAGCCAGAGTTGT^ 
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GGAGCTCACCGATCTTTCCACTCCTTACCCCAAAATC^TTAGTGATGTGCTCATCAGAG 
AGCAGTTGGTCTTAAGGTGTGGTTCACCACAGGACTCACTGATGAGCAGAXCTGCTAAT 
CAAC\TAAGCAGATCGAXAGCCACA.GGGTGCAAGCCAGCCCTTCAACA.GA.GCAGTGGAA 
GACAATGGAATTGCCL\GTTAACAJIGACTATAGCAGCAAATGTA^ 

CAGCCTTGTTCTACGCCATCCCCAGTGAJ^ATGTCAGAAAATCAAGAGAAGCTGTGTGTG 
TTAGCAGCTGAATTGTTAGAJ^ACTGTAAT^ 

ACGGACCGGCGACGCGTGCTGTGCTAJIGTACACAAA^ 

TTGTTCTGGAAACGTCGGAATCTGATGTGAAAGTTCTCTACGCAGATTATGGAAACATC 
GAAACCCTGCCTCTTTCCAGAGTGCAGCCCATCCCAGCCAGCCACCTGGAGCTGCCCTT 
CCAGATCATTAGATGCTCACTAGAGGGGCCGATGGAGCTGAATGGAAGCTGTTCGCAGT 
TAGTGATGGAGCTGCTGAGAAATGCCATGCTGAACCAGAGTGTGGTTCTCTCTGTGAAA 
GCCATTTCAAAGAATGTCCACGCAGTGTCAGTTGAAAAATGTTCTGAGAACGGAATGAT 
CAATATAGCTGAGAATCTGGTGATGTGTGGCCTGGCAGAAAACCTCACTTCTAAAAGGA 
AAAGTGCTTCCACTAAAGAGATACCACACAGCAGAGACTGCTGTTGCACAGAGTTACAG 
AAACAGATTGAGAAACACGAACAGATTCTCCTCTTCCTCTTAAACAATCCAACCAACCA 
AAGTAAATTCACAGAGATGAAAAAGCTGCTGAGAAGCTAAaaacatcatctcttggaaa 
taaacactgggaagaaagagacagcaaacgccagaaaaa 

SEQ ID NO.: 44 Spg70 encoded protein sequence Figure 44 

MFSDLRSLQLKXT1XEIKGTVT 

DGYLPVKGEVCVAKYTVDQTWNRAJVQA ? IDSVHPLSR 

GIJDLFPPSAIKCCVSGVTPTAGEWSEGCVAAVKALLFEQFCSVKVMDIL 

LVLQSSGKQLDHVLVEMGYGVKPGEQSSTEQSVDHSALEDVGRVTVESKIVTDRNALI^ 

KVLTIJWGDEFCGWAHIQTPEDFFCQQLQSGHKIJ^LQESLSEYCGHVIPRSDFYPT^ 

GDVCCAQFSEDDQWYRASVIAYASEESVLVGYVDYGNFEILSLKR^ 

ALNCVLAGVKP SLG IWTPEAVCVMKEMVQN^^ I DKS VAPHVS 

ASKALIDSGFAIKEKDVADKGSSMHTASVPLAIZGPAEALEWTW 

MYSPGEFYCHFLKDDAIjEKLDDI^QSLADYCAQ 

ALVKZILPSGNVKVHFVDYGNVEEVT^ 

TKEATTRFQACWGLKLQARVVEITANGVGVELTDLSTPYPKIISDVLIREQLVLRCGS 
PQDSLMSRPANQHKQ IDSHRVQ AS P S TEQ WKTMEL P VNKT IAANVLE IIS P ALFYAI ? S 
EMSENQEKLCVLAAELLEHCNAQKGQ^ 

VKVLYADYGNIETLPLSRVQPIPASHLELPFQIIRCSLEGPMELNGSCSQLVMELLRNA 

MLNQSVVLSVKAISKSTVHAVSVEKC^ 

HSRDCCCTELQKQIEKKEQILLFLI^^ 

SEQ ID NO.: 45 Spg85 cDNA sequence Figure 45a 

ccactgaagaaagagaaggtgggctcatcatcagcccggaccacttccttccctccaat 

gactggaagagatcctgggtaagtaagcrgcacccctgaggcgagataaaacttcccaa 

agccaaagctgtagatattttgggcagaaatgactccaggtttaagctgtcggttggga 

gaactatttgggcctccctigaatgaggccaccacccagggcttctcctatgaccttctg 

aagaagatagactictcctcagcgactcaccggcagcccaccctggtttggtagccccat 

tcagggaagcccaaatagttctcccaaccgacagctcaaacctggaatcatttccgccc 

aaaacatctacagctttggctctgggaac-tctatcccacctcagggATGCAGCTTACT 

TACCCAGGATCTCTTCCAGTCATTGGAGAGAAGGAAGTGGTCCAGGCTGATGA.TGAGCC 

CACCTTCTCTTTCTTCAGTGGCCCCTACATGGTCATGACTAACCTCGTGTGGAATAGGA 

GCAGAGTCACAGTAAAGGAGCTGAACCTTCCCACCCGTCCCCACTGTAGCAGGCTGA.GG 

TTGGCCGACTTGCTGATTGCTGAGCAGGAGCACAGCAGCAACCTGCGGCATCCTAACCT 
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GCTGCAACTGATGGCTGTATGTTTGTCCCGGGACCTGGAGAAAATTCGCCTGGTTTACG 
AGCGTATCGCAGTCGGC^CACTGTTCAGTGTCCTCCATGAACGAAGGTCCCAGTTCCCA 
GTGCTGCACATGGAGGTGATTGTGCACCTGTTGCTCCAGGTTGCTGATGCCTTGATATA 
CCTGCATTCCCGGGGGTTCATCC^CCGCTCCCTCAGCTCCTACGCTGTCCACATCGTCT 
CTGCAGGAGAAGCAAGGCTGACTAACCTGGAATACCTGA 

GCACACAGGAACGTGACTCGAATGCCCCTCCCCACCCAGCTGTACAACTGGGCTGCA.ee 
AGAAGTGGTCTTGCAGAAGGCAGCCACGGTGAAGTCAGACATATACAGCTTTTCCGTGA 
TCATAC^GAGATCTTAACAGACAGTATACCCTGGAATGGCTTGGATGGCTCACTTGTT 
A^GAAACCATAGCCTTGGGAAATTATT^ 

CTATGATATTGTTAAGTCAGGAATCCATGCCAAGCAGAAGAACCGAACAATGAACCTTC 
AAGATATTCGTTATATTCTGAAGAATGACTTAAAGGAATTTATTGGAGCTCAGAAAACT 
CAGCC.AACCGAGAGCCCCAGAGGGCAGAGCTATGAACCCCATCCTGATGTTAATATCTG 
CCTAGGTCTAACTTCAGAJITATCAAAAGGACCCTCCAGACTTGGACATCAAGGAACTAA 
AGGAAATGGGTAGTCAGCCCCATTCACCTACAGATCACTCCTTTCTCACTGTAAAACCA 
AC^CTAGCTCCTCAGACCCTAGATTCAAGTCTGTCAGCCCAGAAACCTGACAATGCAAA 
TGTTCCTTCTCCTCCTGCTGCATGTCTGGCAGAAGAGGTCAGGAGCCCCACTGCAAGTC 
AGGACAGCCTCTGCAGCTTTGAAATCAATGAGATCTACTCAGGCTGCTTGACACTGGGA 
ACTGACAAGGAGGAAGAGTGTCTGGGGACTGCTGCTTCACCTGAGGGGGATAGACCAAA 
CCAGGGAGATGAGCTGCCATCCCTGGAAGAAGAGCTCGATAAGATGGAGAGAGAATTGC 
ACTGTTTTTGTGAAGAGGACAAAAGCATTTCAGAAGTTGACACAGACCTTCTTTTTGAG 
GATGATGACTGGCAAAGTGATTCTCTTGGTTCACTCAACCTGCCCGAACCAACCAGAGA 
AGCCAAGGGCAAAA.CGAGCAGCTGGTCCAAGACTGATGAGTATGTCAGTAAGTGTGTGC 
TGAATCTGAAGATTTCACAGGTGATGATGCAGCAGAGCGCTGAGTGGCTGAGGAAGCTT 
GAGCAGGAGGTAGAGGAGCTCGAGTGGGOVCAGAAGGAGCTGGACAGTCAGTGCAGCAG 
TTTGCGGGATGCTTCATTAAAGTTTGCV^TGCCAAGTTCCAGCCGGCTGTAGGCCCTC 
CATCTTTGGCCTATCTTCCTCCTGTTATGCAATTACCAGGGCTCAAGCAGCCTGAAAAT 

GGT<^C£COT 

ACATTTTAGGAAAAAACCTGAGAAACTAAGTGCCTGTGGCTC 

TGTCTGAAGAAAGC^GAGGGGACTGCTCAGAGCTAAACAATCAGCTGCCGACTCTTCGT 
GGTCCTGGGAAGCAGAGCACAGGTGAGCAGTTACCATCCACTCAAGAAGCAAGGGAGAG 
TTTGGAAAAAAATACAAACCLAAAATAGTAGGAGTATGGCGTCTC 

ATGCTACTAAGTCAAGAAATAATGAGGATA^TGGAGAGGCACACTTGAAATGGAGATTG 

GCAGTAAAAGAAATGGCAGAGAAAGCAGTTTCCGGACAGCTCTTATTACCTCCTTGGAA 

TCCTCAGAGTAGTGCGCCTTTTGAGAGTAAGGTTGAAAATGAGAGCACTCCTTTGCCAC 

GGCCCCCAATTAGAGGTCCTGAGAGC\C\GAATGGCAGCACATTTTAGAATACCAGAGG 

GAAAATGATGAGCCCAAAGGAAATACGAAGTTTGGCAAA\TGGACAACAGTGACTG^ 

(^GAACAAGCACAGC^GATGGACAGGCCTCCAGCGCTTCACTGGTATTAGATACCCAT 

TCTTC^GAAACC^CGAGCAGCCAGAGCAGAATGAAGCCTCTC^ 

TCTGTGGGCACTGAGAAGTTCTACAGCACCTCAAGTCCC^TAGGAGACGACTTTGAAAG 
ATTCCAAGATTCTTTTGCCC^ACGTCAAGGCTATGTTGAAGAAAATTTCCAAATAAGAG 

AAATATTTGAAAAGAATGCTGAGATTTTGACC^^ 

GCTGAAGAC AAAC AAG AC GAAAC ATTAGGGG AGACGC C AAAGG AAC TGAAAG AGAAAAA 
CAC\TCACTGACAGACATTCAAGACTTGTCCAGCATCACCTATGATCAAGA.CGGCTATT 

TTAAGGAAA.CCTCATACAAAAC^CCCAAATTAAAACACGCACCAACTAGT 

CCGCTAAGCCCAGAGTCGATTTCTTC^GCTGCTAGTC\CTATGAAGACTGCCTTGAAAA 

TACCACATTTCATGTTAAAAGAGGATCTACATTTTGTTGGAATGGCCAAGAAGCTATGA 

GAACTTTGTCTGCCAAATTTACAACTGTCCGAGAGAGAGCTAAGAGCCTGGAATCACTT 

CTCGCTTCTTCTAAAAGCCTACCTGCCAAGCTGACTGACTCCAAGAGATTGTGTATGTT 
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GAGTGAGACTGGCTCTTCTAACGTTTCTGCGGCATTTGTAACATCAACTCATGCTACCA 
AGAGGAAGAGCCTACCCAGAGAACTGGCAGAAGCCACCTCTC^ACAGCATCTTGATGAG 
CTTCCACCACCAC<:TCAGGAGCTACTTGATGAAATTGAGCAACTGAAGCAGCAGCAGGT 
CTCATCCCTGGCGTCACATGAGAACACGGCACGTGATCTGAGTGTCACTAACAAGGATA 
AGAAGCATTTGGAAGAACAAGAAACCAACAGTAGTAAAGACAGCAGTTTTCTTTCC^GC 
AGAGAAATTCAGGATCTGGAAGATACAGAGAGAGCTCATTCTTCTCTTGATGAGGACCT 
GGAAAGATTCCTGCAGTCACCTGAGGAGAACACGGCACTGCTGGACCCTACCAAGGGCT 
CTACAAGGGAGAAAAAAAJ^CAAAGATCAAGACGTTGTTGAGCAGAAGAGAAAAAAGAAA 
GAAAGCATCAAGCCAGAGAGAAGGGAGTCAGAC^GCTCCCTAGGGACCTTGGAAGAAGA 
TGAACTAAAACCCTGTTTTTGGAAGCGACTGGGTTGGTCCGAACCTTCCAGGATAATTG 
TGCTGGATCAGAGCGACTTGTCAGACTGAttggaactggaccgngcaagcattgtggct 
gtggcctcctttcttctcttatctgcttcagttgcctcaaggacagcagtttcaatctg 
taactcacactttgttctgctgctactatgggcacaataatgtgtccctatcatg^tgfa 
gcatgcttaatcatttgttttaaatcaggtttctgaaaagtgcaaagtaaccatagtgg 
caaacattttgtgttcagaatggcttttgttttttctgatgtaaatttittgaaaccata 
actttg-ttatataataaagtgtttactttcaatgctaaaaa 

Figure 46 

SEQ ID NO.: 4*6 Spg'85 encoded protein sequence 

MQLTYPGSLPVIGEKJETVVQADDEPT 

SRLRLADLLIAEQEHSSNLRHPNLLQ^^ 

SQFPVLHMEVIVHLLLQVADALIYLHSRGFIHRSLSSYAVK 

QDSGAHEJNWRMPLPTQLYISIWAAPEWLQKAATV^ 

GSLVKETIALGNYLEADVRLPEPYYDI^ 

AQKTQPTESPRGQSYEPHPDVNICLGLTSEYQKDPPDLDIKELKEMGSQPHSPTDHSFL 
TVKPTLAPQTLDSSLSAQKPDNANVPSPPAACLAEEVR 

XTLGTDKEEEeLGTAASP.EGIDRPNQGD^^ 

I^FEDDDWQSDSLGSLNLPEPTREAKGXT^ 

LRKLEQEVEELEWAQKELDSQCSSLRDASLKFANAKFQPAVGP?SLAYLPP T 7MQLPGLX 
QPENGGTWLTLARS PGNEREFQEGHF SKXPEKLS AC GWKPFTQ VS EE SRGDC S ELNNQ L 
PTLRGPGKQSTGEQLPSTQEARESLEKI^QNSRS^ 

EYQRENDEPKGNTKFGKMDNSDCDKNKHSROT^ 

SCDTSVGTEKFYSTSSPIGDDFERFQDSFAQRQGYVEENFQIREIFEKNAEILTKPQFQ 

AIQCAEDKQDETLGETPKELKEKNTSLTD IQDL S S ITYDQDGYFKETSYKTPKLKHAPT 

SASTPLSPESISSAASHYEDCLEOTTFHVKRGSTFCWNGQEAMRTLSAKFTT^ 

LESLEASSKSLPAKLTDSKRLCMjSETGSSWSAAF 

HLDELPPPAQELLDEIEQLKQQQVSSIIASHENTARDLSVTOEC^ 

FLSSREIQDLEDTERAHSSLDEDLERFLQSPEENTALLDPTKGSTREKK^ 

RKKKESIKPERRESDSSLGTLEEDELKPCF/VKRLGWSEPSRIIVLDQSDLSD. 

Figure 47a 

SEQ ID NO.: 47 Spg87 cDNA sequence 

AGAGGTTTCAGGCTTAGGAAGAAAGGGGTAGTAG 

GAATAACTTAATAGATGTGTTACAGTTGGGAATATCGGATTTCCTCTGGCCCAGGTGTC 
CAGGTGAGCACTTCAGGCATTACTGAGGAATCTGTGTTGCTGTATTACTGTTCCGTGAT 
GTCAAACCCTGTTTTCCACACAGTATAACGCAACAGCATAGTGTATAGTATTATAGACC 
AGACAGCTTGGGCCTGGAATTATTGCTCCCCCAC^CCTATCCCTACCCACACCTCA.GGC 

AAAACAAGCAAAAA3CCCCAAATCTACTT^ 
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AACACACAGAGGAAGTATACTTGGTTTTTATTCCGAGGAAGGTGGTAATTGTATCCTTT 
CCCCTTTGGCTGGGGTCTGAACCTCTGCTAGTCTGAGAJ^TTGGTGCATAAGAAAATGGA 
GGAAGGGGAAAGGAGTTATTGCTCCAAAGAGGAAAGGATCAT^ 

TTCTTAGAATAGAJUVTAGGACCTTTGAGTAAATAAAAGTTTTACTTGAAGTGGGGGGAG 
GGATTAJ*CTCCTAAJ\CACAAGTTTTA^ 

TTTCTTATAAATCCTTCCTCTTTTTTTCCTTTTAAAACACTCTTTTCTTCAAACTCAT^ 
GTGGTTTTCTTTTTCTGTTCTATCTTGGAATAAGAAACTGCCCTGGGGAGGGAGTAGTA 
CCTGTTTATTAATAGTCAACTCAJ^CTATCCTATGTTATGTTCCTAGAATAAAAAATTGT 
TACATCTATTTCATTACCATGGTTTAAGGTAAGCTCACCTTCCTAAGGGTAGTTCCTCT 
GTCAGGAGAATGATCTATCTGCTTTGGAATCATTGCCCTTTGATGTGCAAAGGAGTTTG 
CCTCCTTTTAAGTGAAGTATCCAGGGTGTGGGGCTCCAGAAAGTATTACAGTGTTTTAA 
ATGTAGCATTTGCTTGTAATATTTGTA^ 

ATAATTGGGACACATTCGTCAATTCTGAGAACTTGTTATTTTTTTTCTATTTTCATAGT 
GTTTACTGTGAATTAAAATTTGTTATGCCACTGCCAAA AAATAAA GCTTACTTTGGAAC 
ACAAAAA 

Figure 48 

SEQ ID NO.: 48 Spg87 encoded protein sequence 

MIYLLWNHCPLMCKGVCLLLSEVSRW 
THSSILRTCYFFSIFIVFTVN 

Figure 49 

SEQ ID NO. : 49 Spg84 cDNA sequence 

TTTTCCCAGGGAAGAGAAGGGAAGGAAAAGTCAGCATGTGTCACAGGATGATTATACGA 

TAGGAATACAAGGGCCCGGGTACAGTAAAAATCAACTAGGAAATAAAACACCCCAGGCA 

GGGGTGGGGGTCATCCAAGCCCCGGGGGTAGGGGGGCCTTAGTCTCCTGCAGTTCAGGA 

AAGAGATGGGGAAAGGACAGACA^GTACACCCTTTCCCACCCTCCCCAT 

TTTCTCAC^^ 

GTAACAATCTTGTGGCCTCTTTGACATAAGTTTCTTGCACCGGCA.TTAAGTTCCTGAGT 
GACTCACTGTACATTCATAGTCCCTTTTCTGGTGTCCAGATTCCAACTTTCTACATGGA 
AGGCCCCACAGTTGCTTCTCTCGACCTAAGGGCGACAGATTACATTGGTTTGATCCATC 
CAAAGGCGAATCACACCATGAAJVTAAATGGTCT^ 

CCCTGC^CAGGGGACTCTGTGCAGAAGGCAGGAGGAGAGCCAGGAACCCGAGGCTCTCT 

ATCTCCAGCTACAGAGTGAGGCCCTGTCTCA^ 

TTAGGGACAAGGAAAGAATGAGGTTCCTGGCTTCTTCAGAATTTTC^ 

AAGAGTTACAGAGATGCTCAAAGAAGTAAATGAGCCTAAGTTTTCTTACGAGGGAGAAG 

AGGGGCTGTCTGAAGGTCACTGTGGTCCAGAATGGGAGACACAGGAGGAGCAGGTAAGG 

ATAAGCCATC^TCCCAGAGGTTCGCTTTGAGAAGAGGACCGCACATTTAA.GTTTCAGAT 

GGTGTTGTCTGGTGCAGAGCTTTCTGTCA^ 

AACAAAAGTTTTGTAAACAATAAAACCCCAAAAGAACLATGAAGAC 

TTACACAACTNGGCCA.CCTAGCAJVTCACC^^ 

CCAGACGAATACAATTAACTAGGGGTGAGGGCAGCA*TTTGGGGGGA.GAGGGAGACAAGA 

AACCAAGATCAGACACACACACACACCACCC^ATGTGTTCTATTG^^ 

CAACACAGTGAGGTAGGTTTGTGGCGATCTCAAAGCATGTAAACGAAACTCCCCACCCC 

CTCTTTCCTTTAGCTATCAACCA.^^^ 

CCCATACTC\TGAGTCTCTCAGAAAGCCAAGATTGTGAAAGGCTAAGGAGTTGAGTC 

TGCCTCATGTTCCCATGCCTGGACAGCACAGTGTGGGAATAAGGGTACTGGTCTTAAGT 

TGCATTCTTACTGGGATAA.CTCAGTGGCAGTGTACTTCCTAGCACAAATGA^GCCCAAG 

ATTCCATCCCCAGCACVAAAAGCAATCAJLATAAACAAAAC 

GCCCACGCGTCC 
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SEQ ID NO. : 50 hSPGl cDNA sequence 

TTCGCCTCTACGTTCCGCGCGGGAGCCCACGCGCGGTTCGTCCGGAACCCACAGACCAG 

AGACGCAGGTCCC^GCCTTTTCGGTGTCGGCGCCAGTTCCCGGAGGAGCGGACATGAG'T 

GAAAGCCAGGATGAAGTTCCTGATGAAGTTGAGAACCAGTTTATATTGCGTCTGCCTCT 

GGAACATGCTTGTACTGTCAGGAACCTAGCACGTTCTCAAAGTGTCAAGATGA^GGATA 

AACTAAAAATTGACTTATTGCCTGATGGGCGCC^TGCAGTTGTTGAAGTAGAAGATGTC 

CC^CTAGCTGCTAAGCTGGTTGACTTGCCTTGTGTTATTGAAAGCCTGAGAACGCTTGA 

TAAAAAAACCTTTTATAAAACAGCAGACATTTCTCAGATGCTTGTGTGCACTGCTGATG 

GTGATATCCACCTTTCTCCAGAAGAACCAGCTGCCTCTACTGATCCTAATATAGTCAGG 

AAAAAAGAAAGGGGGAGAGAAGAAAAATGTGTCTGGAAGCATGGCATTACGCCACCACT 

TAAGAATGTCAGAAAGAAAAGGTTCCGGAAAACACAAAAAAAGGTCCCTGATGTCAAAG 

AAATGGAAAAAAGCAGCTTTACTGAGTACATTGAATCTCCAGACGTGG>JW^TGAAG'lrA 

AAGAGACTGCTGCGTTCGGATGCTGAAGCCGTAAGTACCCGTTGGGAAGTCATTGCTGA 

AGATGGAACCAAGGAAATAGAAAGTCAAGGCTCCITCCCAGGATTTTTGATATCCTCGG 

GAATGAGCAGCCACAAGCAGGGTCATACCTCGTCAGAATATGATATGCTTCGGGAGATG 

TTCAGTGATTCTAGAAGTAACAATGATGATGATGAGGATGAGGATGATGAAGATGAGGA 

TGAGGATGAGGATGAAGATGAAGACAAAGAAGAGGAGGAGGAAGATTGTTCTGAAGAGT 

ATCTGGAAAGGCAGCTGCAGGCCGAGTTTATTGAATCTGGCCAGTATAGGGCAAATGAA 

GGTACCAGTTCAATAGTCATGGAAATT(l\GAAGa^GATTGAGAAAAAGGAGAAAAAGCT 

CCATAAGATTCAGAATAAAGCACAAAGACAGAAGGATCTCATCATGAAAGTGGAAAACC 

TGACACTCAAGAATCATTTTCAGTCTGTGCTGGAGC^GCTTGAGTTACAGGAAAAACAA 

AAAAATGAGAAGCTCATTTCCCTACA.GGAACAGTTGCAGCGTTTTCTGAAGAAGTGAGG 

AGAGCCATTGGCGTGGCCC\CAAACCTTGGATTCACCATCCAGACTGCAGATTGGATGA 

AACTGTGCA.CGTTTTTGTCCCTTCAGTTGCCTTATGTTGAATCAGTTATATTTATCTGT 

ACCTTCTTTGCTACTTAAAATATGCTCACAGTTTGTAGTCATGTAGAAAAGGCCAGTAT 



GGAATTATTATTATTAATTTAGAAATGGGGTCTTGTTATGTTGCCCAGGCTGAATTC^G 
ACTCCTGGGCTTAAGGGATCCTTCCGCCTCAGCCTC 

Figure 5 1 

SEQ ID HO.: 51 nSPGl encoded protein sequence 

MSESQDEVPDEVENQFILRLPLEHACTVENIARSQSVT(MKDKLXIDLLPDGRKAWEVE 

DVPLAAKLVDLPCVIESLRTLDKKTFYKTADISQMLVCTADGDIHLSPEEPA.\STDPNI 

VRKXERGREEKCVWKHG IT? PLXNVRKKRFRKTQKXVPDVKEMEXS SFTEYIES PDVEN 

EVKRLLRSDAEAVSTRWEVIAEDGTKEISSOGSIPGFLISSGMSSKKQGHTSSEYDMLR 

EMFSpSRSNWDDDEDEDDEDEDEDEDEDEDKEEEEEDCSEEYLERQLQAEFIESGQYRA 

NEGTSSrVMEIQKQIEKKEKKIiKKIQNKAQRQKDLIMKVEMjTLKNOT 

KQKNEKLI SLQEQLQRFLKK 

Figure 52a 

SEQ ID HO . : 52 hSPG3a cDNA sequence 

aaaggtgggagttctttccgggataattttgacaagaggagctgtcattatgaacatgg 
tgggtatgagcgcccgcctccacactgccaggagaatgatggaaacgtggagATGAGGG 
ATGTCCAC^AGGACCAACAACTAAGACAC^CTCCTTATAGCATCCGATGCGAAAGAAGA 
ATGA^TGGCATAGTGAAGACGA\.iTCCGTATTACCACGTGGAGAAATAGAAAACCTCC 



CAATTCCTTACGGGATAAAGTATGACAAGGC^TGGCTAATGAATTCAATCCAGAGCCAT 
TGO.GTGACCGCTTC\CTCCGGTTGATTTCCACTACGTCCGAAATCGGGCATGCTTCTT 
TGTCCAGGATGCTAGCGCTGCCTCCGCATTGAAGGATGTCAGTTATAAGATTTATGATG 
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Figure 52b 



ATGAGMCCAAAAGATATGTATATTTGTO^CATTCTACTGCGCCCTACTCTCTGAAG 
TGTCTCCCAGCAAGCTCTTGATCTCCAGAATCTCCGCTTTGACCCAGACTTGATGGGCC 

Gft^GAAATTTCCCTGAGCTGTTGTCTTTC 

CTATGGCTAGAAGGGAACCCGTTGTGCAGCAC CTTCTCGGAC CAGTCCGC CTATGTAAG 
CAGTGATTGTTGACACTGACAGCTCTGAGACAATGAAACCCTGCAAGGA 

SSctSgIcSta^tttagtcc^ 

TCACTCTG^^TCGA.CAGGGTCTCCTCGGTGCTTACCACGATGAGGCCTTCW^CCT 

^S^ccttcgaccccaagc^ctcagccccgagcagcttgtccaagtactttgag 

S^cgtcacattgtggactccct^gtgcgttgcccaaaactcagc^c-acctc^ 
^^ccmcctcgtggacgtgtggtgccagacggaaaggatgctctgcttttctgtcaat 

cttcattgctocccctggcagcagttccagtctgtgcatcgtgaatgacgagctgot 

GAGGCCTTCAAGCAAATCTCCTAAaaggagccctccgatgtcttccttgtcttcgtcca 
caccctctttgtttcctcttttcaccagcctaaggcctggctgaccaggaagccaacgt 
taacttgcaggccacgtgacataaccacccaaagagccagttgctccgtgtattcgccc 
cactcatgatcaccattttattttcataacaaagagtgacgttacacgttaaaaa 

Figure 53 



F SLAIPFDPKDSAPS SLCKYFESSKM^TLKDEYI^GELL^TKM ^VDSL 
DLSSILVDWCQTEI^CFSVWFXEVEGQSQ 

^aSS^sipvstlssssepslsqeqq^-'saqsgmklhwsqkclqdn 

EWNYTRAGQAFTMLQTEGKIPAEAFKQIS r ^ ^ 

SKQ ID 130. :S4 hS?G3a genomic DHA sequence 




TGGGTATGAGCGCCCGCCTTCACACTGCCAtwAi*^ 

ATGTCCACAAGGACCAACAACTAAGAC^GTAAGTGACCAGGCAGCCTGGT^TCC^GTA 

ggcctcagtgagtggagcatgtataggao&ctttcttaga 
a^gtgaagacgaaatccgtattaccacgtggagaaatagaaaacctccggagagaaaa 
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ATGAGTCAGAACACACAGGATGGATACACAAGGi^CTGGTTTAAGGTCACAGTGAGTAT 
CTTGGTGGGGTCTGCATTAGGTGGACTATTCTGGAACCTGATAGAAGGAAGACCACTTA 
AAACACCCTAAGTTGATTATTTGGAGGAGAGCTTCGGAATGGGGGGAJU^GGGAGTTTAG 
GGCATCTATATTTGGCACAAAi^TAAGAAATCATGTC^GC 

TAAGCTAAGTAGAAGGTTGGAAAGAAAGAAAAAAAAAACCCAGCTGGTTGGTTCTC 

TCCATCCTTAGTTCCACGTTGTCTCTCCCTTCCCTCCTATTTCTTCTTTTTACCCTTAG 

ATTCCTTACGGGATAAAGTATGACAAGGCATGGCTAATGAATTCAATCCAGAGCCATTG 

CAGTGACCGCTTCACTCCGGTTGATGTAAGAGAGGATGGTGAAGCCAGATGAGTGGGCA 

TGGGGGACGGGGAGAGGCCTGGCTCA-GCAGGGGCCATTGGCCTCTGACGCTGTTGCTCT 

TGCCTTCACTCCCTGTAGTTCCACTACGTCCGAJ^TCGGGCATGCTTCTTTGTCCAGGA 

TGCTAGCGCTGCCTCCGCATTGAAGGA.TGTCAGTTATAA.GATTTATGATGATGAGAA.ee 

AAAAGGTGTGTGCCGAGGGCATGCCCTGTACTTAGTCTCTGGGCAGGAGGACAGGCCAG 

GGGGCTGGTCATCCTCTTTGGGATTAGAGGTCCTGGTACTTACCACCCTGCCTTdCTGC 

AGATATGTATATTTGTCAATCATTCT^ 

CCAGGCCAAATGGAGATGCTAAAGGTAATACAGACTCAAGGATCATTGTATC 

CCTGGACCCACCTCTTCTTCCCCTGGCCCCCTCTTTCCCTGTCACCACACCACCACCAC 

CACCACCATCACCAGAGCCCCAGAGCCTCTGTCTTCATCTCTATCTCTGCAGCTGACCA 

TGAACAAACGGTACAATGTCTCCCAGC^GCTCTTGATCTCCAGAATCTCCGCTTTGAC 

CCAGGTAAGGCTGACAGCAGCAATTCTAAGACAAGCGGGGGCAGAGAGGTCTGCCTGGG 

AGGGAGACTTAGGAJITGGCAATTTACAGAGGGGTTGGGGCTGGCTCTGGTCCAGCCA-GG 

GCCCTCCCAGCCTTCCGATTCCCTTCTCTTGGCTTCTTCAAGACTTGATGGGCCGTGAC 

ATTGATATAATC CTGAATC GAAGAAAC TGCATGGCTGC CAC C CTGAAGATC ATTGAAAG 

AAATTTCCCTGAGGTGAAGCCTTAGGCTCAGTGCTGGTATTTAGTTAGAGGGGTGGAAG 

GGATAAGGTGGAGGGCAGATTTGTCTCTGAGGCCCAAGATAGTAGCCGCCACTCTAACT 

CTTCTTGACCCAAAGCTGTTGTCTTTGAACTTGTGCAACAACAAGCTGTACCAGCTGGA 

TGGCCTTTCTGACATTACAGAGAAGGCTCCC^^^ 

- - ATAAGGTGAGAJ^GGGGGAGCCAGATC^^ 

ATGGCAACA.GCCAGGCAGTGGCACCTGTGGGTGACTATGAGGGCCGGGGGAATTCAGGG 

CCCAGGGTCCTGGGTGTCTCTCTTTCCCTGGCCCTCCTTCTCCAGTTTCCTCCCCATCT 

TTCTTAGCTGGAGTCGGCGTGGGAGTTGGGCAAGGTGAAAGGGCTGAAGCTCGAAGAGC 

TATGGCTAGAAGGGAACCCGTTGTGCAGCACCTTCTCGGACCAGTCCGCCTATGTAAGG 

TCAGTGGCAACCCCGGTCACCCTTCCTGGGCACCTTTGCTCCCTGGGTGA.CTGAGCTGT 

GTCTGAAGGTGCCCTTCTGCAGGAAGAAGC^GCCTTGGTCCTCTGGGAGGACCACAGAC 

CTCCCTTCCTACTCTCTCTCTCTCTCTCTCTCTCCTCTCTCTCTGTCACTCACTCATCT 

GTGCTTAGAGGTCTCCTTTCCTTCCTCTGACATGGTCCCCTTTTCACCTGCTCTGGGGT 

GTGTTTCCCGCCTGTCTCCACCAAGCCTCCTCCAGTGTGCCCTCTGTGAGTGTGCTCCA 

GGAAGTGGGGCTCCCCCACCTCCCCAGGACC\GCAGTATTCAGATGCTGGTGCCCTGGA 

CCGAGAAGAGTCCTTTAGTCCGGGGCTTCATGCTGAGACAGGCCTTCCTGCCTCCATGC 

TGCGATGGGGCTTCCCTCCCCTGTCCCAAGAGGGGTTCTCCTTCTCTTGCTCCAAAAAG 

GTCCCCCCACCTGTCCTGGGCTGGCATGGGGGCTTCCCTGCCCCATACTGAGGGCTGGG 

GCCCCGGGTGGCTGCTGTCATGCCATCTCTTCCTCTGGCCCAATGCCAGGAGCTGGGCC 

CTCCTTGGGGAGAAACCTGGGTTTCCTACGTCAAGGGK 

GAGGCAGGAGTGGAJIGGCAGAGGGCAGAGGAGGTTGAGAAGACAG 

(^GC<!ACTGCCCCACCTTCCCTCCTTCTCCACCTCACATCGTCCTGCCTGGGCCCTCAG 

AGGAGCCCTGGGTA-GCCCGA.GATGGGTAGCATTCCTCGGTCAAGGCGTCAGCACA.GAGG 

GGCACAGGA.GTCAGGGACCATCAGAAGAGAATGCAGTGGTCTGGAGAGGGGGTCCCCAG 

ACTCTGAACCCCATGCTGAGCTGGGGCCTGACTCTTCACTCCTCCCGGGAGAAGGTCTC 

CTGTCCCCGTGGCTGCTCTGTTTCCCATGCCCAGCTCAGACTGAGCTCACACAGGTGAG 
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GAAGGCTCCAGCTGCATCCAGTGGGCCCCAGCCCAACAGGCGCATGTTTTCCTTTCCTG 

CCTCGGTCCCCAGGGCCAGCCAGGGAGCAGTGAGGGAAAGGGCTGCAGCAGGGGAGCCC 

TTTTCTCCTCTTCTCCTCCCTGAACTCCACCTCCGCAGTAGAGAGTCTTTCCCTTCCCT 

TTGC^TTGCATCCTGTTTCTCCCTTGTCTCTCCTCTCCTATGTCTC\CCCATCCGTCCC 

TCCCCTACCTTCACCCC6TTTCTGTTGTTGTCCCCCCTGCCTTCCGCTTCCTGCCTCCT 

GAGTCCGGCCTCACTCACCTCCGTGTCCCAGCATCCTGGGCCATCCCTAAGGGCTGACC 

TGGTCTTGGCCAGGGCCTGGTCAGGCAGGTTGATGGACAGCCAGTGAGGTGGCAGAGCC 

CTGGGCTCCCACCCCATTTCCTGCTCCCTGCAGAGCCTTCCATGGTGACTTGGGCAAAG 

GGGAGGGAGGGAGAGGAAGAAAGCCCTGGAGCCTGGGCTCCCAGTGCTGCTTCTTGTAG 

CACTGGAGAAAAGGGAGTCAGGACAGTCTAGATGGAAGCTAACCAGGAGGAAGGA.GAGG 

GAGGAGTGTGAGAGGGAGTGGGAGAGAGACTGTGCAACCCTGAACTGTCAGTCACTTCA 

TTCAATTTTTGGTTTTGGACAGTGCCATCCGGGATTGTTTCCCCAAGTTGTTACGCCTG 

GTAAGTAT(lTATAATACCGTCATCATTGTCTCCTCTTACTCAAGAAAAGGACCTClfAGG 

CCTGCCCTCAJ^GTCCTTTGGGTCTTGCCTAGATTACATGCTTGTATCAGACCCTCATCC 

ATTTTACAGGCATGGATTCCTGAAAAAGACAAGAATATTCTCCCTGGAAAATGTGTCCC 

CCCGCCGCCACCCCCCCCCCACACACACATTTGCATGTGATAGTAGAGATGTCCAGCAC 

CCCATGAGAAAGCCACCCACTGGAATTTCCAGGGCCATTTCCCACCTAGCCTCTGATGC 

TTGTCTCCCTGGGCATGTTCATCTTATCGATCATCCAGCTCCATCTCCTTGGTCTCCAC 

TGCTGACTTCCTCTTTCCTTCTCCAGGACGGCCGAGAGTTATCCGCACCAGTGA.TTGTT 

GACATTGACAGCTCTGAGACAATGAAACCCTGCAAGGTGAGGAAGAAGGACCAA.GCAAG 

ATTTGGGTTGCTGTAAGGGAGGCTTTGTCCACCGCATAGATCCAAATTGTCTTTTGATT 

TCAGGAAAACTTTACTGGATCTGAGACCCTAAAGCATTTAGTCCTGCAATTCCTGCLAGC 

AGTGAGTATCCCTGGGACCATGAGGAAGGGGAGGGCTGAGACAGGCTGGGCCACCCGTG 

CAGCCTGGGAGTTTTCAAGTCTCATCTGGGGCCCAGGCCACAGAGATAGCCTATCCTCA 

CTGCTTCCCCAC^GGTATTACTCGATCTATGACTCTGGAGATCGACAGGGTCTCCTCGG 

TGCTTACCACGATGAGGCCTGCTTCTCCTTGGCTATTCCCTTCGACCCCAAGGA 

CCCCGTGAGTATCACGGCTCAGACCCTGCTCTGGGGCTGTGTGTCTCCCCAGCAGACAC 

AGGCCAACTCCTGGAAATGCCCACACTGGCCGGACCACCCACTCCTGCTCCTCTTTTTC 

TCCTAGGAGCAGCTTGTGCAAGTACTTTGAGGATAGCAGGAATATGAAAACACTCAAGG 

ACCCCTGTAAGTGTGTGATGGGGAAGAGTGGGCAAGGTAAGGGGGTGTGATGGGAACAA 

TCACAGGGGCCAAGGACCAGGATGTGGTAGCCCCCCGCCCTGCCCCGCCCACCCTGCCA 

TTCCTTGCTTCTCCTCTCCTCTACAGACCTGAAGGGGGAACTGCTGAGGCGCACAAAAC 

GTGACATTGTGGACTCCCTCAGTGCGTTGCCCAAAACTCAGCATGACCTCAGCTCCATC 

CTGGTGGACGTGTGGTGCCAGACGGTGAGCACCTGCTTCCTCCCTTGGGCAGGCCCAGA 

GAGCCAGAGGTGGGTAGGAGGTTAAGGAGGATCCTGAGCACCTGAGCGCTTCCTTTTCA 

GGAAAGGATGCTCTGCTTTTCTGTCAATGGGGTTTTCAAGGAAGGTGAGTGTCTGTA.TA 

GTCCCCTCCCCAGATCCCCCACTGCTCCCTCCCCCTGGCTGGGCTCCCTCTCAGAACTC 

CCCCAGCTTCCCTGCTTTCGTTCCTTTCCTTTCCTTCCTCTTCTTTCTTCCGTGTTTTC 

CCACCCCCACTCTGCCTTCAAACCACCCTGATCTGACCTAGGTCCATGCCTGTCTGCCC 

TGCACAGCTCAGGCGTGCGTTAAGGACACAGACTGTGGAGTTTGACAGCTCTCATCCCA 

GGTCCTTACTCTGTGAACTTGTGCTGGTTACTTAACCCTTCAGTTTCCTCATTTGCAAA 

ATGGGGCTAATAATCTATCTCTTGGGCTACTGTGAAATAGGAATTAAGTGAACTTGTGG 

TTTTCCCAGGGCCCCACACATGATAGGGGCCCTGTCACTGGGAGCGGATTGTTCCTGTT 

GCTGCCCTCCCCATTCCTGCCACATCCGCCTGACTCCAAGTGAACAATTGTCCTGGTCT 

GCCCCCTCCCCCCCTTCTGTGTGAGTGTCAAGCAATA.CTCTGACTGGGGATCACCGTGT 

GAGCATGTTAAAGCCTGTGCAACTCTAAGGTGGTGGTGTTTGTTGTCTTTGAAGTGGAA 

GGACAGTCTCAGGGTTCTGTTCTCGCCTTCACCCGGACCTTCATTGCTACCCCTGGCAG 

CAGTTCCAGGTTAGTGCTGTGTTGTGGGTGGGAGCACCCATCCAAGCTTGGGGCCAGTG 
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TTGTGGAAATGTGGTGGGTGCAGTC 

GGAAGATCCAAGGAGCAGTCTAGCCTAGTGTTTAAGAGTGTGGGCCCTGGAGTCAGAAT 

ACAGATTCTGTTCCTCACTCCAACACTCACAAACTGTGTGACCTTGATCAACTTATTCG 

ACCACTCTGTGATTCAGTGCCTTTTTCTGAAAATTGGAATAAGACTATCCACTTCCTTG 

GGCTGTTGTACAGGGTAAATGCGTTGGGGTTGGATCTAAAAATCTAGTGAAGCTGGTAG 

ACAGTCCCTCCAAAGGTGGACTCTGTGGGAGGGTTAGAGGGTACC\GCCAAAAAATCTG 

GGAGGCAGGC\CAGTTAGGGATATGGAAGGAATTTGGTTGTTGAGTGGCAGTGGTTAAG 

AAGGATCCTGTTGTTGGGGGTGCGGAGTTATCTACTTGTCCAGTTTGAGGGTGCATTTT 

TCTTTCCTCCAGTCTGTGCATCGTGAATGACGAGCTGTTTGTGAGGGATGCCAGCCCCC 

AAGAGACTCAGAGTGCCTTCTCC\TCCCAGTGTCCAC\CTCTCCTCCAGCTCTGAGCCC 

TCCCTCTCCCAGGAGCAGCAGGAAATGGTGCAGGCTTTCTCTGCCCAGTCTGGGATGAA 

ACTGGAGTGGTCTCAGAAGTGAGTGCTGGGAGTACATGGGGATGGGGGCTGTTGGGACA 

TCAGAGGAATGAGTAATGGAAA.CTC\C^TGCAATTTGGAAAAATAACTATCTGGGTA^ 

TTGCTTCCAAAAAAAGTGGGTCCATGAAAAAGGTATCATACTTTTATACTGGTATATGT 

AAATATTTTTTTAAATGGCATAATGCCCAAATGACATTACCTTCCATTTGTAAAATCTG 

AAAGAATCLAACCACAATAAACTACTGATA^^ 

ATACAAAAATCATTTGTACTTCTATGTAGTC 

AAGAAAATAAGTCCATCTACAGTAGCATCGAAAAGAAGAA^^ 

AAGAGAAGAAGCGCAAATCTTGTACTCTGAAATCTGCAi^^^ 

AAGAA.GACCTAAATACGTGGAAAGGCATCCCACGTTCATAGATTGGAAGACTTAATATC 
ATTACGATGGCAGTACCACCCAGAACAATCTACAGATTCATTGCAGTCCCTGACAGAAT 
CCCAACTGACTTCTTTGCAGAAATTGACAAGGTAATCCCAAAATTCATGTGGAAATGCA 
GTGGACCCCAAACAGCCAAAACCATCTTGAAAGAGAAGAACAACGTTAGAAGACTCACA 
CTTCCCGATTTCAGAACTTGCTACAAAACTACATTAATCAAGACTGTATGGTACCGACA 
TAGGAACAGACGTGGGAATCAATGGAATATAATTGAGAGTCCACLA.GATAATCTCACGTA 
TTTATGTCCAGTTGATTATCATTCAGGGTGC TGAAAAA ATCCAATGGAGAAAA AAAATA 
GTCTTCTTAACAAATGGTGCT^ 

GACCCCTAACTCACATCACGTGCAAACACTGGCAGAAAATGGATCAATGACCTAAAATA 
AGAGCCAGAACTGTAAAACACTGTAAGTGTACATCTTCATTACCTTGAATTAGGCAACC 
ATTTCTTACATATGAAACCAAAAGCACAA^ 

TTCATCTAAATTAAAAGCTTTTGTGCATCAGCAGACACTATCAAGAAAGCGGAAAACCG 
ACTGGTGGGAAC^GGGAAAATATTTGCAAATCACATCGCCGACAAGAAGAACCCTTAC 
AACTCAACAACAAACAGACAAGCCACCGAATT^^^ 

TGTTTCTCCAAAGGAGATATACAA^TGACCAAGAAGCACGTGAAAATCTTCTCAACATC 
GTTAGTCATTAGGGAAACGCATATCGAAACCACAGTGAGTTACCACTTCATACCCACTA 
CGCTAGCTTTAGTCC\TAAAAGGAAACATGACAAA 

GGAAATCTC^TGTATTACTACTGGGAACATAAAGTGGAGCAGTTGCTGGCAAAAAGATT 
TTGGCAGTTCCTCAAAATCTTAAACATGGAGTTACCACATGATCCAGTAATCCCACTCC 
TAAGTGTATACCAAAAGAAATGAAA^TATATGCCCATTCAACA^CTTGCACATGAATTT 
CCGTAGTGGCATTATTCCAAATAGCCAAAAAATGGA^ 

GATGAATGGATA^TGTGGTACATCCATACGGTGGAATATTATTAGAATATTATTCATCC 
ACAAAAAAGGATGTAGTTGTGATATATGCTATGACGTC 

GTGCTAAGTGGGAGCACCCAGTCACAAAAAGCCACATAATTATATGATTCCATTCATAC 
GAAGTGTTC AGAATAGC CAGATCCGTAGAG AC CGAAAGC AGAGT AGTGGTTGC CAAG AT 
CTGGGGAAGAGGGAGAACAGGGAGTGATCTCTAAC^GTTAAGGAGTTTCTTTTTGAGGT 
GATAAAAACAGTTTGGAATTAGATAGGTGTGATGGTTGCAC^TCTTGTGAATAGACTT 
AAAkGCACTGAATTGTACACCTTAAAAI^ 

AATAGAAAAGAAACGTATTATTGAATTTCCACTTGTTATTTCTTGAACATCTTTCTTTA 
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Figure 54e 

TCAATATGTATTAAGC 

TAGTGGGCATTAA.TGTCAGGGATGGGCGTTTTGGTTTTCCCCAGGCCTTTTTTCATTGT 

TACAATAGTGCTCATATTGGTACL^TGTGACCCAGCAA^V^GGTAGCATAGA.TTAAGGGT 

GGC^TTGCATAGTCAGCGTGTCTGTCCTGGGCTAGTAATGGAGAGCACCTGTTCTTCTC 

CC^CCCCAGGTGCCTTCAGGACAATGAGTGGAACTACACTAGAGCTGGCCAGGCCTTCA 

CTATGCTCCAGGTGAGGTCTGGGAATCAAGTGGGTAAAAGACAGCTGTCTCTGGGTCGT 

CAGGAGGGCCAAGAAGATGGAGGCCAGGTAGTGTGGGGATGGAACCCAGTGCACCTGGC 

TCTACTAACATCCCAACTCCTTTTCTTTACTTTCTCTAGACCGAGGGCAAGATCCCCGC' 

AGAGGCCTTCAJ^GCAAATCTCCTAAJiJVGGAGCCCTCCGATGTCTTCTTTGTC 

ACATCCTCTTTGTTTCCTCTTTTC^CCAGCCTAAGGCCTGGCTGACCAGGAAGCCAACG 

TTAACTTGCAGGCCACGTGACATAACCACCCAAAGAGCCAGTTGCTCTGTGTATTCGCC 

CCACTCATGATCA-CCATTTTATTTTCATAATAAAGAGTGArnTTACArRTT 

SEQ ID NO. : 55 liSPG3b cDNA sequence Figure 55 

CCAGC\CGAAGGAGACCCAGAGTGCCTTCTCCATCCCAGTGCCTGCACCCTCCTCCAGC 

TCCTTGCCTACCCTCTCCCAGAAGCAGCAGGAAATGGTGGAGACTGTCTCCACCCAGTC 

TGGGATGAAACTTGAGCAGTCTCAGAAGTGCCTTCAGGACAGTGAGTAGAACTACACCA 

AAGCTGACCAGGTTTTCACTATTCTCCAGACCGAAGGCAAGATCTCAGTGGAGGCCTTC 

AAGC^AATCCCCTAAAAGGAGCCCTTCGATGTCTTCTTTGTCTTCATTCACATCCTCTT 

TGTTTCCTCTTTTTACCAGCCTAAGGCCGTGCCC^GGACTGGGGTTGGCAGCCTGGCTC 

ACCGGAAAGCCAAAGTTAACTTGCAGGCCGGGTAACATAACC 

SEQ TD NO.: 56 hSVG5 cDNA sequence Figure 56a 
ATGCCCAGTGATGCa\AAGACAGTGTTAATGGTGACCTTTTGTTAJVATTGG 
TAAAAATATTTTAAGTGGTCTTAATGCT 
_GCACAGTCACTACTT_CAJ^TC 

ATGGGAGAACAGAGTAGTACTGCAGGCTTAAATGAGGTTTTGCAATTTGAGAAGAGTTC 

AGATAATGTTAATTCAGAAATAAAATCGACACCATCTAATTCTGCCTCCTCCTCAGAAG 

TTGTCCCTGGTGATCGTGCTGTTCTTACTAATGGTTTGGATACCCCTTGCTTTAAAACT 

TCTGTTAATGA.TTCACAATCTTGGGCTCACAACATGGGCTCTGAGGACTA.TGACTGTAT 

ACCTCCCAATAAAGTTACCATGGCAGGGCAATGTAAGGACCAAGGTAATTTTTCCTTCC 

CAATTTCTGTGTCAAATGTAGTGTCAGAGGTTGAGAACCAAAACCACAGTGAGGAGAAG 

GCTCAGAGAGCCCAACAGGAGTCCGGTAATGCTTATACAAAAGAGTACAGTAGTCACAT 

TTTTC^GGACTCGCAGTCTTCTGATTTAAAJ^ACAATTTATCAGACTGGTTGCCAAACGT 

CTACAGTTTTTCCACTCAAAAAGAAAGTAAGC^TTGATGAATACCTTCAAAATACTGGA 

AAGATGAAAAACTTCGCTGACCTGGAAGACAGTTCCAAACATGAAGAAAAGCAAACTTC 

ATGGAAAGAAATTGATAATGATTTCACTAATGAAACAAAAATCAGTCCA^ 

AGATTGTTTTGCACCAAGAATACAAAGAGAGTC^ 

TGTGATAAAATATTAATTACTCAAGA.GTTAGAAJITAACAAAATCTTCTACATCTACCAT 

AAAGGATAAGGATGAACTAGATCATCTAGCATTGGAATGGCAA-ATTACTCCAAGTTTTG 

AGAGCCTGTCACAAJ^GCATCCTCAGCACTCTGTGGAG^ 

AGTTTAGCCATTGCTCAAAAGCTAATGGAACTG 

TGCTAGCATTATAACTGAAGCTTTC^ 

TGTTC^TTGATA.CAGTTATTTCATCTTATAACATAGAAACAGCTCATGACAGTTCAAAT 
TGCAGCATAACTAGAGAACATATATGTGTCC^^^ 

ATTAGAGAACATTCAGA.GAGA.CTATAAAGAAACTGCTTATGTTGAAGATAGGGGTCAGG 

ATCACAATCTGTTCTGTAJVTTC^^ 

AAAAAACAAJ^CAGATAGAGAAAJ\CC\?iAATGA(^ 
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Figure 56b 

AGAAAACAACATAGAGAACATATA^ 

ATTTCAGC\ATATAGATGAAAAGGAGGACAAi^TTAC 

TCTGAAGaATTTTCTACTAAATTTiW^ 

AGCAACTGCATTA.TTAGAGA.GTGAJ\GAAGATA.CCATTAGTGCCGTGAAA-CAAAAJIGATA 

CTGAAAATACTGGAAGAAGTGTAGAGCATTTGGCTTCCACGACATTTCCCAAAACTGCA 

AGTTCTTCAGTGTGTGTAGCCTCAAATGCTGC^ATACAGATAGCTAGTGCTACTATGCC 
TGCATTAAGCCTAAATAATGACGATCA^ 

CTGAAAGTCCAGATTTTGGTTTGTTAGTAAAAC^TAGGGTTTCTGATTGTGAAJ^TTGAT 
ACGGATAAAJ^TAAATCAC\AGAATCATTT^ 

TCAGAGCATTGAATTGGAAAGTGAAATTGAAATAGAATTAGAAGATTGTGATGATGCTT 

TTATATTTCAACAAGATACACATAGCCATGAAAACATGCTTTGTGAAGAATTTGTGA.ee 

TCATATAAGGCTCTGAAGTCTCGTATC^GTTGGGAAGGTCTGTTAGCACTTGATAACGG 

GGAGATGGAAGTTTTGGAAAGCACCACAGGAAGGGAGAATAGTGATCAGCATTATTC'PA 

AGGAAAGTAACTATTTTTATTCCTCTACACAAA^ 

ATTTTACTTCCAGATCTACAAATTAAAATTACTAAT 

GACAGCTGACTCCCTTGCATTGAAAGATAGTTTTTGCACACATGTAACTGAAGCCACAA 

AACCGGAAATAAATAAGGAAGATGGAGAAATTCTAGGATTTGACATTTATTCCCAGCCT 

TTTGGTGAAAATGCAGATTATCCATGTGAAGATAAAGTTGATAATATAAGGCAA^ 

AGGGCC1AGTGAGTAACTCTGAAATCTCCCTTTCTTTTGACTTGAGTCGTAATACAGATG 

TGAATCATACGTCTGAAAATCAGAACAGTGAATCTTTGTTTACTGAACCTTCTAATGTC 

ACAACAATAGATGATGGAAGCAGATGTTTCTTTACAAAATCAAAAACTGACTATAATGA 

TACOVAAAATAAAAAGGAGGTAGAATCAAGAATTAGCAAAAGGAAGCTACATATATCTT 

CGAGGGATCAGAACATACCACATAJU^GATTTAAGACGAC^ 

AGGAGGCTAACCAGTCAJ^GACTCATCTGAGTGTTTCTCTTCATTATCCCAAGGACGAAT 

TAAAACATTTTCACAGTCAGAAAAGCACATTAAGAGTGTCCTAAATATCCTAAGTGATG 
AAGGATCTTTATGT^^ 

TTAAAAAAAGCTCATAGAAGAGTTCACACATCTTT 

AGAAAGAAAGGGCCCA.TTACCAAJ\ATCATATGCAGTAATATGCAATAJ^TT 
GTTGTGACCTTCAAGGTTATAGTTCTGTC^ 
TTTOCGTC1AAAAAGAAAATATGACAAACGGAGAAA 
TTCTAAATCATTAACCGATGTGTCAAAGCACAAGTCTTATA 

AATGCCTTTCTAGGAAAAGTATGGCTAGC^GTGTCTCAAAAAGTCACCCCACCACCAGT 

CACATGGGAGAATTTTGTAATCAAGAACATCCTGAATCACAGTTGCCTGTATCCTCCAC 

ATCCCAAAGTACAAGTCAGTCAGTTTATTATAATAGCAGTGTAAGCAATCCAAGTTTAT 

CAGAAGAACATCAGCCCTTTTCTGGAAAAACTGCATATCTGTTTTCCCCAGACCACTCA 

GATGAGAAJ^CTAATAGAAAAAGAAAATCAAATTGATACAG^ 

TAAATATGAAAAGCTTGAAAAACATTCAGC^^ 

AAAAC\GTTGTGACGCTAATGAAGTAATAAATGAAAGTAATTCTGTATCTO 
ATAAAAGAAAACATAAATTCTAGTACAGGCAACGATO 

CAC^AAGGCGAAAACTGACGTACTTATATCAGTCTTAGATTCAAJ^TGTGAJVGCACTTTO 

TAAATGATCTCTACCAJ^CAJIGGTAACCT^ 

GTAAAGTGGACAGATCCTATTGAGAGACCCAAA 

TATGGGCCCATTAAACCTAACTTTGATAGCAAGTAAAAAGTACAGTATTCCTCAGGTA.T 
CAGCCGCTGCAGTGACAGATAGTGAGGGAGAJ^TCTTCAAAATCTTACTTGGA.TAAGCAG 
AGAATTCTTACTGTAGATTCTTTTGCAGCATCCAGTACTGTACCACACTGTGAGCAGAG 
CTGTAGAGAAAAAGAGCTTCTAAAGACAGAACAGTGCTCTTCAGGTAATTGCCTCCATA 
CAGATGGGAATGAAACAJ^TGTCACTGAGAATTATGAGTTGGATGTAGCATCAGGAACT 
GAAGAAGATAAAAGTTATGGGGAAAATATAGTGGAATTATCTTCCAGTGATAGTTCTCT 
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Figure 56c 

GCTTTTAAG AG AT AATGTAAAAGGCT C CT CTTC^GAAAC ATGT ATTGTG AAG AAAG AC A 

CTGAGGAC^GAATAJ^CGTCX^AGTTAA^ 

AAAAGAAGCATGACTGAAGGATC^tOT^ 

GATCTCAGAAGAATCCTGCTTAAATGAGAAAATTATTACAACTAACTTGATTGATTCCC 
ATCTGAGCACTAAAAATACTACCACTGAGTCAGTCCCTTTGAAGAACACAGTTTCTAAT 
CCGCTTAACAJIAAGAGAGAAGAAGGGGGAAATTAAAGTTAGT 

CTTGACATTAC^TTCAGAAATAGCCTATATTTCCAAACCAGGAATTCTAGGAGTTAATC 

ATACGCCT ATTTTAC CTGCCC ACTCTGAAAC CTGTAAAGTCCCTACTCTTCTGAAG£ AA 

CCTGCGTCATACGTGAGTGATTTTAAAGAAAAACATTGCTCAGCTAATCATACGGCCCT 

TATAGCTAATCTATCTCAAATTTTGCAGAGGGCAGATGAAGCATCATCTTTGCAGATT 

TACAGGAAGAAJVCTAAGGTTTGTCTAAATATTCTCCCTTTATTTGTGGAAGCTTTTGAA 

AGAAAGCAAG AATGTTC AGTTGAACAAATC C TG ATTTC AAGAGAACTGTTGGT AGACCA 

AAACC TGTGG AATAATTGCAAACACACATTAAAAC C ATGTGCTGTTGACAC TTTGGTAG 

AACTTCAAATGATGATGGAftACAATTCAATTCATTGAAAACAAAA^ 

GGTGAACCAACATTGCGAAGCTTGCTTTGGTATGATGAAACACTGTATGCTGAGCTTCT 

TGGAAAACCACGTGGATTTCAACAGCAGTCTAATTTCTATCCTGGTTTCCAAGGAAG 

TAAAATATAATGCATTCTGTGAGTTACAGACTTACCATGATCAATTAGTTGAATTGCTT 

GAAGAAACAAAAAGGGAAAA-GAATTC^TACTATC 

TAATGAATGTGAAGCCATAATGGAGCATTGTTCCGATTGCTTTGATTTTTCTCTTTCTG 
TTCC\TTTACCTGTGGAGTTAACTTTGGAGATAGTTTAGAAGACCTGGAAATCTTAAGA 
AAAAGTACTTTAAAGTTGATCAATGTATGTGGGGACTCTCCTAAAGTTCATTCGTATCC 
AGGAAAACAGGACCATCTGTGGATTATCATAGAAATGATCTCCTCAAAGGTTAATTTTA 
TTAAGAACAACGAGGCAGTACGTGTTAAAATATCTCTTTATGGTCTGGAACATATCTTT 
TTTGATGCTGCAAAAAATCTTGTTTGGAAAGAGAGA^ 

CTCACAAAAGAAGGACGAAGAAAGGCTACTCAGAGTGAATAAATGTGCCTTTTCTAAGT 
TGC\GAAGATATATGATACTTTGTCTAAAGATTTAAACAATGAACCAATTTCCCCTATT 

_GGGCTTGAGGAGGATAGTATAATTGGTTGGAGAAAGTCAGATCL\TCCAATAAACGAAGC^ 

AACAATTAGCAT AG AAAATTC TAAATTT A^.C AGTAATTTGCTTGC ACACC CAGATATTT 
GTTGTATTAGTGAGATATTGGATCAGGCTGAATTTGCAGACCTTAAAAAATTACAGGAT 
CTCAC C TTG AGATGTACAGATCLAC TT AG AAATTTT AAAAAAATAC TTTC AGATGC TAC A 
AGATAATAAC ATGG ATAATATTTTTATC\C AGAAGAAAATG T TTTAGACGTGG TG AT AA 
AC CACAGC CATG AGGCTATC ATTTTAAAGC CTG AAGC TATTG AAATGTATATTGAAATC 
GTCATGGTCTCAGAAACAATTCACTTTCTTAAAAACTCAATAGCAAAGAA^ 
ACAGAGGTTTCG AGGTATGCTTTGGTTTGATTTGTCACTTCTTC C TGAGC TGGTTC AGT 
GC CAAGAAAAAATGGCTTCTTTTTC ATTT CTTAAAG ATAAC TC AAC AGATGTTTG C T T 
TGGAAAGTGATAGAGACTGCTGTTTCCGAACTTAAGAAAGATCTGGATATTATCTGCAA 
ATATJ^TGAAGCTGTTAATTGCTCATATGCTAT^ 

AACTTTOAGAAATAAAAAAGCTTCTGAAGAAGTCCAAGTATTTTATTTCCA.CA 
GACTTTGTGCCATATATAGCATCCATAAATTATC 

C AACT ACAATC AATTTTC TACAC TGCTG AAG AA.TGT AATGTC TGC C C CT AGGAAAGATT 

TAGGAAAAATGGCCCAC\?TAGGAAAGTCL\TGAAA\CGATTGAACATA 

TGTACT AAAAATG CTGAACTAAC CATTTC CTTTTTC C TATGCC A AATGCTGT AT AA.CAG 

AAGGAAGATTTTACAGCTGAAGAGAAA^GAAAAA^TGAATATTCATATTGTAA^ 

GGGAAAATAAC AATAAATTTAGTATTTCTACGATGTTGC CC CC AGTATCAGAGTGCATA 

AACAAAAAC ATCTCAAATTCCTCTAAAAAACGAC C GAGC ACTGTAGACAAATGTGAAGA 

CTCTCAX^AACAACAGCAAGATACTACTGT^ 

TGAAAGATGTCAC\AAA\TC\ACAGAGAAAAGGCAACATTCAAG(^ 

GGATCTCATCCCAAJ^GCGAAJLAC^^^ 
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Figure 56d 

AAATCATTT AAC GC C AAAAAAJ^TTGAAATGCAAAGATC AC T ACC TGGCTC ACTTTTAC 

CCTTAGAGAACCCAAAAGACACTTGCGC^^ 

GTTTC^TCTGATCACTTCAGTGGACAACAGGAAAATTTAAATAG 

TGTGAACTTCAGTGCTGCTGAAACAAAAAGTGAT^AGAAAGATTGTGCTGCTTTTCCAA 

TTTGTGACCAAAAAAGTGTACATGGCACATTTTCACCA.GACCATGGGACGCTTTT 

AAATTTCTTAAAAATTC CC CAGATCCCACCCAAAAATCC TGCCTTTCTGATATAAACC C 

AGAAACTGATGTTTCTCTTGTGCCTGATGCGTCGGTGCTCTCAAAGCCAATTTTCTGTT 

TTGTGAAAGATGTCCATCCTGATCTAGAAATGAATGACACAGTCTTTGAACTTCAAGAT 

AATGATAT AGTAJU^TTC ATCTATTAAAAATTCCTCATGC ATG AC TTC TC C AGAACCCAT 

CTGTATCCAGAACAAJIATTCCTACTCTGCAGATAAACAAACTACAGCCTACAGAAACTG 

AGTCL\GAGGACAAA.TACATGAAGGATAC^TTGAATCCCAATACTG 

GCATCTGGGCATATAACCCTTAATGTGAATCAAGGAGCAGAGTACTCTCTTTCTGAACA 

ACAGAATGACAAAAATTCAAAAGTCCTAATGCAGA^ 

TTC CAC AGTC TGCATGT AACC CAAC AT ATAATTCTTC TGAGC ATTTATTTGGAAC TTC A 

TATCCATACTCTGCTTGGTGTGTTTATCAGTAC\GCAACAGCAATGGCAATGCCATTAC 

CCAGACATAC CAAGGGATAACATCATATGAAGTACAGCCATCTCCTTCTGGGC TGTTGA 

CCAGAGTTGC^GTACTGCCCAGGGCA(^C^TTCTAATCTTCTGTACTCTCAJVTATTTT 

ACTTATTTTGCGGGGGAGCCACAAGCA^TGGCTTTGTGCCIAGTGAATGGGTAT 

ATCTCAAATACCTGCTTCTAATTTTCGGCAGCCAATTTTTTCACAATATGCTTCTCATC 

AGCCATTACCACAAGCTACATACCCTTACCTTCCTAATCGATTTGTGCCTCCAGAAGTT 

CCTTGGGTTTATGCTCCATGGCACCAAGAATCCTTTCATCCAGGACACTGA 

Fi°Tore 57a 

SEQ ID NO. : 57 hSPGS encoded protein sequence s 

MPSDAKDSWGDLLLNWTSIJCNILSGLNASFPLKWrGSSTVTTSKS IKDPRLMRREES 
MGEQSSTAGLNEVLQFEKS SDNVNSEIKSTP SNSAS S SEWPGDRAVLTNGLDTPCFKT 
SV1QDSQSWAHNMGSEDYDCIPPNKVTMAGQCKDQGOT 

AQRAQQESGNAYTKEYSSHIFQDSQSSDLKT^ - - 

KMKNFADLEDSSKHEEKQTSWKEID^ 

CDKILITQELEITKSSTSTIKDKDELDKIJUliEWQITPSFESLSQKHPQHSVEYEGNIHT 
SLMAQKLMELKiGKIItfQNYASIITEA^ 
CSITREHJO/KRKNENEPVSLENIQRDYKE^ 
KKQTDRENQNEAKENSASCVEmiENIYGDKXQDSHTNE^ 

SEEF S TKFNL I C REDNAVS AAT ALLE S SEDT I S AVKQKDT ENTGRSVEHL AS TTF P KTA 

SSSVCVASllAAIQIASATM^ 

TDKNKSQESFHQSINENLVLQSIELESEI^^ 

SYKALKS RI SWEGLLALDNGEMEVLESTTGRENSDQHYSKZSNYFYS STQNNETELTS ? 
ILLPDLQIXITNIFRPGFSPTADSLALCT 

FGENADYPCEDKVDNIRQESGPVSNSEISLSFDLSRNTDVNETSENQNSESLFTEPSNV 
TTIDDGSRCFFTKSKTDYISIDTK^ 

RRLTSQDSSECFSSLSQGRIKTFSQSEKHIKSVLNILSDEASLCKSKCLSRKLDKAVVK 

LKXAERRVHTSLQL!TK7\/GE 

FSSKRKYTDKRRKKRA?KAI)ISKSLTH^ 

HMGEFCNQEHPESQLPVSSTSQSTSQSVYYNSSVSNPSLSEEHQPFSGKTAYLFSPDHS 
DEKLIEKENQIDTAFLSSTSKYEKLEKr^ 

IKSNINS S TGNDCD ATC I GHTKAKTD VL I SVLDSNVKHFLNDL YQQGNL I L SDCKRNLS 
VKWTDP IERPKQS I ITGNFLMGPLNLTL I ASKXYS IPQVSAAAVTDSEGESSKSYLDKQ 
RILTVDSFAAJdSTVPHCEQSCREKZLLKTEQCSS 

EEDKSYGENIVEL S S SDS SLLLRDNVKGSSSETC rVK2QTEDRITWKVKQ AEKAKDSVY 
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Figure 57b 
KRSMTEGSTVNTEYKNQKNQI^ 

PLNKREKKGEIKVSKDSQSDLTLHSEIAYISKPGILGVNHTPILPA^ 

PASYVSDFKEXHCSAJEiTAilANLSQILQ 

RKQECSV^QILISRELLVDQNLWI^ 

GEPTLRSLLWYDETLYAELLGKPRGFQQQSNTC 

EETKREKNSYYVFLKYKRQVNECEAIlffi 

KSTLKLINVCGDS PKT/KSYPGKQDHLWI I IEMI S SKVNF IKNNEAVRVKI SLYGLEHIF 
FDAAKNLVWKZRTQ SF SKKY S QKKDEERLLRVNKC AF SKLQKI YDTLSKDLNNEP I S P I 
GLEEDTI IASRKSDHP INEATI S IENSXFNSNLLAKPDICC I SEILDQAEFADLKKLQD 
LTLRCTDHLEILKKYFQMLQDI^^ 
VWSETIHFLKNSIAKKLDKQRFRGMLWFD^ 

WKVIETAVSELKXDLDI ICKYNEAVNCSYAIHLLSRELQELSEIKKLLKKSKYFISTYI 

DFVPYIAS INYGSTVTELEYNYNQFSTLLKNVMSAPRKDLG 

CTKNAELTISFFLCQMLYNRRKILQLXRK^ 

NKNISNSSKXRPSTVDKCEDSQEQQQIXET^ 

GSHPKSENKIVPSSCDSLKRMILTPKKVEM^ 

VSSDHFSGQQENLNSMKKENVOTSAAETKSDKKD 

KFLKNSPDPTQKSCLSDINPETDVSLVPDASV^ 

ND IVNS S IKNS SCMTS ?E? IC IQNKIPTLQ INXLQ PTETESEDKYMKDTLNPNTVHTFG 
ASGHITLNWQGAEYSLSEQQNDKNSKVLMQNAATYWELP^ 

YPYSAWCVYQYSNSNGNAITQTYQGITSYEVQPSPSGLLTTVASTAQGTHSNLLYSQYF 
TYFAGEPQ ANGFVP VNGYFQ SQ I ? ASNFRQ P IFSQ YASHQ PL ?Q ATYP YL PNRFVP PEV 
PWVYAPWHQESFHPGH . 

Figure 58a 

SEQ ID NO-: 58 hSPG5 genomic DNA sequence x ° 

— -ATGCCCAGTGATGCCAAAGA.a\GTGCTAATGGTGACCT_CT 
TAAAAATATTTTAJIGTCWTC 

GCACAGTCACTACTTCAAAATCCATCAAAGACCCAAGACTGATGAGGAGAGAAGAAAGT 
ATGGGAGAACAGAGTAGTACTGC\GGCTTAAATGAGGTTTTGCAATTTGA.GAAGAGTTC 
AGA-TAATGTTAATTCAGAAATAAAATCGACACCATCTAATTCTGCCTCCTCCTCAGAAG 
TTGTCCCTGGTGATCGTGCTGTTCTTACTAATGGTTTGGATACCCCTTGCTTTAAAACT 
TCTGTTAATGATTCACAATCTTGGGCTCACAACATGGGCTCTGAGGACTATGACTGTAT 

ACCTCCCAATAJ^GTTACCATGGC^ 

a^TTTCTGTGTCAAATGTAGTGTCAGAGGTTGAGAACCAAAACCACAGTGAGGAGAAG 

GCTC^GAGAGCCCAACAGGAGTCCGGTAATGCTTATACAAAAGAGTACAGTAGTCACAT 

TTTTCLAGGACTCGCAGTCTTCTGATTTAAAAJ^CAATTTATCL^GACTGGTTGCCAAACGT 

CTACAGTTTTTCCACTCAAAAJIGAAAGT^^ 

AAGATGAAAAJICTTCGCTGACCTGGAAGACAGTTCC 

ATGGAAAGAJ\ATTGATAATGATTTC\CT^ 

ACATTGTTTTGCACCAAGAATAC^GAGAGTGAGAGTCATAAT^ 

TGTGATAAAATATTAATTACTCAAGAGT^^ 

AAAGGATAAGGATGAACTAGATCATCTAGC^T^GGAATGGCAAATTAC 
AGAGCCTGTCACAAAA.GCATCCTCAGCACTCTGTGGAGTATGAGGGTAACATTCATACA 

AGTTTAGCCATTGCTCAAjy^GCTAATGGAAC^^ 

TGCTAGCATTATAACTGAAGCTTTCCCGAAACCAAAA.GACATACCCC^GGCCAAAGA^ 
TGTTCATTGATACAGTTATTTCATCTTATAACL^TAGAJ^CAGCTCATGACAGTTCAAAT 

TGC^GCATAACTAGAGAACATATATGTGTCCA^ 

ATTAGAGAACATTCAGAGAGACTATAAAGAAACTGCTTATGTTGAAGATAGGGGTCAGG 
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Figure 58b 

ATCACAATCTGTTCTGTAATTCACAGTTAAGCAATGATATATGGCTGAATGTTAATTTC 

AAAAAACAAACAGAJTAGAGAAAACCAAAJVTGAGGCTAA^ 

AGAJ\AACAACATAGAGAACATATATGGA.GACAJ^AAGCAGGA 

ATTTCAGCAJITATAGATGAAAAGGAGGACAAAAATTACC^CAATATAG 

TCTGAAGAATTTTCTACTAAATTTAACTTGATTTGCAGAGAAGATAATGCA:GTGTCAGC 

AGCAACTGCATTATTAGAGAGTGAAGAAGATACCATTAGTGCCGTGAA^AAAAAGATA 

CTGAWUITACTGGAAGAAGTGTAGA 

AGTTCTTCAGTGTGTGTAGCCTCAAATGCTGCAATACAGATAGCTAGTGCTACTATGCC 

TGCATTAAGCCTAAATAATGACGATCACCAGATATACCAGTTTAAAGi^^ 

CTGAJ\AGTCCAGATTTTGGTTTGTTAGTAAA^ 

ACGGATAAAAATAJVATCACAAGAATCATTTCATCAATCAATAAATGAGAACTTA.GTTCT 
TCAGAGCATTGAATTGGAAAGTGAAATTGAAATAGAJVTTAGAJ\GATTGTGATGATGCTT 
TTATATTTCAACAAGATACACATAGCCATGAAAACATGCTTTGTGAJ^GAJ\TTTGTGAC^ 
TCATATAAGGCTCTGAAGTCTCGTATCAGTTGGGAAGGTCTGTTAGCACTTGA.TAACGG 
GGAGATGGAAGTTTTGGAAAGCACCACAGGAAGGGAGAATAGTGATCAGCATTATTCTA 
AGGAAAGTAACTATTTTTATTCCTCTACACAAAACAATGAAACAGAACTTACCAGCCCA 
ATTTTACTTCCAGATCTACAAJ^TTAAAJ^TTACTAATATATTTAGGCCAGGATTCAGCCC 
GACAGCTGACTCCCTTGCATTGAAAGATAGTTTTTGCACACATGTAACTGAAGCCACAA 
AAC CGGAAATAAA.T AAGG AAGATGGAGAAATTC TAGGATTTGAC ATTT ATTC C C AGC C T 
TTTGGTGAAAATGCAGATTATCCATGTGAJ^GATAAAGTTGATAATATAAGGCAAGAATC 
AGGGCCAGTGAGTAACTCTGAAATCTCCCTTTCTTTTGACTTGAGTCGTAATACAGATG 
TGAATCATACGTCTGAAAATCAGAACAGTGAATCTTTGTTTACTGAACCTTCTAATGTC 
ACAACAATAGATGATGGAJ^GCAGATGTTTCTTTACAA^TCAAAAACTGACTATAATGA 
TACCAAAAJVTAAAAAGGAGGTAGAATCAAGA^ 

CCAGGGATCAGAACATACCACATAAAGATTTAAGACGACATAAAATTTATGGGA.GAAAG 

AGGAGGCTAACCAGTCAAGACTCATCTGAGTGTTTCTCTTGATTATCCCAAGGA.CGAAT 

TAAAACATTTTCAC AGTC AGAAAAGCACATT AAGAGTGTC C TAAATATC CTAAGTGATG 

AAGCATCTTTATGTAAAAGCAAATGTCTTTCCAGAAAACTAGACAAAGCAGTTGTTCAC 

TTAAAAAAAGCTCATAGAAGAGTTCACACATCTTTGCAGCTTATAJ^CTAAAGTAGGAGA 

AGAAAGAAAGGGCCCATTACCAAAATCATATGCAGTAATATGCAATAATTTCTGGGAAA 

GTTGTGACCTTCAAGGTTATAGTTCTGTGTCTCAAAGAAAATATTATTCTACTAAGCAT 

TTTTCGTCAAAAAGAJ^AJVTATGAa^ 

TTCTAAATCATTAACCCATGTGTCAAAGCACAAGTCT^ 

AATGCCTTTCTAGGAAAAGTATGGCTAGCAGTGTCTCAAAAAGTCACCCCACCACCAGT 

CACATGGGAGAATTTTGTAATCAAGAACATCCTGAATCACAGTTGCCTGTA.TCCTCCAC 

ATCCCAAAGTACAAGTCAGTCAGTTTATTATAATAGCAGTGTAAGCAATCCAAGTTTAT 

CAGAAGAACATCAGCCCTTTTCTGGAAJU^^^ 

GATGAGAJ\ACTAJVTAGAAAAAGAAAJYTC^ 

TAAJVTATGAJ\AAGCTTGAAAAJ\C^^ 

AAAJUZAGTTGTGACGCTAATGAAGTAJVTA-^^ 

ATAAAAGAAJ\ACATAAATTCTAGTAC-^^^ 

CACAA^GGCGAAAJICTGACGTACTTATAT^^ 

TAAATGATCTCTACCAACAAGGTAACCTTATT^ 

GTAAAGTGGACAGATCCTATTGAGAGACCG*J\ACAJ^^^ 

TATGGGCCCATTAAJ^CCTAACTTTGATAGCAAGTAAJLAAGTACAGTATTCCTCAGGTAT 
CAGCCGCTGCAGTGA.CAGATAGTGAGGGAGAJ^TCTTCAAAATCTTACTTGGATAAGCAG 
AGAJITTCTTACTGTAGATTCTTTTGCAGCATCCAGTACTGTACCACACTGTGAGCAGAG 
CTGTAGAGAJ^AAAGAGCTTCTAAAGACAGAACAGTGCTCT^ 
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Figure 58c 

CAGATGGGAATGi^J^ACAA^ 

GAAGAAGATAAAAGTTATGGGGAAAATATAGTGGAATTATCTTCCAGTGATAGTTCTCT 
GCTTTTAAGA.GATAATGTAAJ\AGGCTCCTCTTC^^ 
CTGAGGACAGAATAACGTGGAAAGTTAAACAAGCGGAAJ^ 
AAAAGAAGCATGACTGAAGGATCAACTGTTAA^ 

GATCTCAGAAGAJ^TCCTGCTTAJ^ATGAGAAAATTATTACAACTAACTTGA.TTGATTCCC 

ATCTGAGCACTAJ^J\ATA.CTACCACTGAGTCAGTCCCTTTGAAGAJ\CACAGTTTCTAAT 

CCGCTTAACAAAAGAGAGAAGAAGGGGGAJ^ATTAAAGTTAGTAAAGA^ 

CTTGACATTACATTCAGAJVATAGCCTATATTTCCAAACCAGGAATTCTAGGAGTTAA.TC 

ATACGCCTATTTTACCTGCCCACTCTGAAACCTGTAAAGTCCCTACTCTTCTGAAGAAA 

CCTGCGTCATACGTGAGTGATTTTAAAGAAAAACATTGCTCAGCTAATCATACGGCCCT 

TATAGCTAATCTATCTCAAATTTTGCAGAGGGCAGATGAAGCATCATCTTTGCAG^TTC 

TACAGGAAGAAACTAAGGTTTGTCTAAATATTCTCCCTTTATTTGTGGAAGCTTTTGAA 

AGAAAGCAAGAA 7 ?GTTCAGTTGAACAAATCCTGATTTCAAGAGAACTGTTGGTAGA.CCA 

AAACCTGTGGAATAATTGCAAACACACATTAAAACCATGTGCTGTTGACACTTTGGTAG 

AACTTCAAATGATGATGGAAJ^CAATTCAATTCATTGAAJ^CAAAAAAAGGCACTTAGAA 

GGTGAACCAACATTGCGAAGCTTGCTTTGGTATGATGAAACACTGTATGCTGAGCTTCT 

TGGAAAACCACGTGGA.TTTCAACAGCAGTCTAATTTCTATCCTGGTTTCCAAGGAAGAT 

TAAAATATAATGCATTCTGTGAGTTACAGACTTACCATGATCAATTAGTTGAATTGCTT 

GAAGAAACAAAAAGGGAAAAGAATTCATACTATGTATTCTTAAAGTACAAACGA 

TAATGAATGTGAAGCCATAATGGAGCATTGTTCCGATTGCTTTGATTTTTCTCTTTCTG 

TTCCATTTACCTGTGGAGTTAACTTTGGAGATAGTT^ 

AAAAGTACTTTAAAGTTGATC^ATGTATGTGGGGACTCTCCTAAAGTTCATTCGTATCC 

AGGAAAACAGGACCATCTGTGGATTATCATAGAAATGATCTCCTCAAAGGTTAATTTTA 

TTAAGAACAACGAGGCAGTACGTGTTAAAATATCTCTTTATGGTCTGGAACATATCTTT 

TTTGATGCTGCAAAi^ 

CTCaCAAJ^GAAGGACGA:AGAA^ 

TGCAGAAGATATATGATACTTTGTCTAAAGATTTAAACAATGAACCAATTTCCCCTATT 

GGGCTTGAGGAGGATACTATAATTGCTTCCAGAAAGTCAGATCATCCAATAAACGAAGC 

AACAATTAGCATAGAAAA.TTCTAAATTTAACAGTAATTTGCTTGCACACCCAGATATTT 

GTTGTATTAGTGAGATATTGGATCAGGCTGAATTTGCAGACCTTAAAAAATTACAGGAT 

CTCACCTTGAGATGTACAGA.TC^CTTAGAAATTTTAAAAAAA.TACTTTCA 

AGATAATAACATGGATAA.TATTTTTATCACAGAAGAAAATGTTTTAGACGTGGTGATAA 

ACCACAGCCATGAGGCTATCATTTTAAAGCCTGAAGCTATTGAAATGTATATTGAAATC 

GTCATGGTCTCAGAAJICAATTCACTTTCTTA*^ 

AGAGAGGTTTCGA.GGTATGCTTTGGTTTGATTTGTCACTTCTTCCTGAGCTGGTTCA.GT 

GCCAAGAAAAAATGGCTTCTTTTTCATTTCTTAAAGATAACTCAACA.GATGTT 

TGGAAAGTGATAGAGACTGCTGTTTCCGAACTTAAGAAAGATCTGGATATTATCTGCAA 

ATATAATGAAGCTGTTAATTGCTCATATGCTATTCATTTGCTCTCAAGAGAACTTCAAG 

AACTTTCAGAAJ\TAAAAAJ\GCTTCTGAA 

GACTTTGTGCCATATATAGCATCC^TAAATTATGGAAGCACTGTGACAGAGTTAGAATA 

CAACTACAATCAATTTTCTACACTGCTG^ 

TAGGAAAJ^JVTGGCCCACAJTTAGGAAAGTCAJr^^ 

TGTACTAAAAATGCTGAACTAACCATTT^ 

AAGGAJVGATTTTACAGCTGAAGAGAAAAGAAAJLAATGAATA^ 

GGGAAAATAACAATAAJ^TTTAGTATTTCTACGATGTTGCCCCCAGTATCAGAGTGCATA- 
AACAAAAACATCTCAAATTCCTCTAAAA.^^^ 

CTCTCAGGAACAACAGCAAGATACTACTGTTTCCAGTTGTAAAAAGCTAAAGGTATGTA 
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TGTTTTAAAACAAAACT^ 

ACAAGTAGTTTGCAGAATTCTAAAAGTTAAGAATGGTAAATTGTCTGGCAAAATGAATT 
TACTAACTATAATATTGATTTAAJ^CAAT^^ 

TATGTATAGGTGATACCTTGCAAJITGTCTACTTTTTAAACGTAAATGTTTTAJICTTAGA 
AAACATTTTTTGGAAGGACGTGGATTTTAAAAGCTTCTTAAGAAGGAGTTCAATATTAT 
GAACACTGAGTGAGTGACCAATATTTATTGAGTAGCTTTTCTGTATAGCAAGCCTATGC 
CCCGTGTAGTGAATATTAJUVAA.GTGGCTAACAAAGCCTGTCTTCTTGACCATTATCATC 
CCAATGGAGAAJ^TAGGACAGATATTTTTCAAGAGATAATTATCAAGGAGTTAAAAGTCC 
TATATTAAATACATTTTAACAAATTGTCTAAAG.CATTTAATATGTACTTGGCACTGATT 
TATTTTATACACAACCCCTTATGATGTATTTCCTATTATCTTACTTOVAGATAAGGAAA 
GTGGGACACAGAGGAAATGACTATCCTGGGGTCACAGAATTTAGTAAATGGGAGCACCC 
AGATCTAAACCAGGCAGTCTGGCCTCCAGAGCCCTTATTGACAGTTGTCTCAGCACTGC 
TCTAAGAGGTTTCTCTTACAGCTGTTCA^GACTTCTTAGTTCA.GAJ^GTTTAGAJ^AAGAA 
ACCTATATGCAGCTGGGTGCGATAGCTC\CGCCTGTAATCCCGGCACTTTGGGAGGCCA 
AGGTGGGCAGACTGCTTGAGTCCAGGAGTTCAAGACCAGCCTGGGCAACATGGTGAGAC 
CTCATCTCTACTAAAAATAACAAAAAATTAGCCAGACATGGTGACATACACTTGTAGTT 
CC^CTACTTGGAAGGTTGA-AGCAAGAGGATTGCCTGAGCACAGGGGCGGAGGTTGCAG 
TGAGCCAGGATTACACCACTGTACTCCAAGCTGGGAGGACAGAGTAAGACTCTGTTTCA 
AAAAAAATTTATATATATAA.TTTTTAAJITTAAGTCTGGAACAATTTAACTTAGTG 
AGAATAATCTATGGATGGAGAAGGTTATTTCAGATTATGGAATATCTTAAATTAGACCT 
AAGGAGTTTGACCTTCATTCTGTACACATTGAAGTGTACTGTCATATGAAATTCGTTTT 
TCTAATGATTTAACAGATAGATTCTGAGTATATAAGTCATATATGTCTTCTGTAGAAGT 
ATATATAGTAATAAGTAGTAGTACTGTATACAATACTACTTACAGTAATAAGTCAGCCA. 
TGGCTTAGAAAAAGGTGTTAGAAJ^GGAiVAATTCATATTGACAGACATGACTTAAAGAAT 
CGATGGGGCTTAGCAACTGGTTTACAAGAAAGGACATAAAAGAGGGAGGGAATAGTCAT 
. _AAATGGACT.TC^GGTTAGTTTTAAGGGCC^^^ - 
GC^CTTTGGGAGGCTGAGGCAGATGGATCACCTGAGGTCAGGA.GTTCGAGACCA.GCCTG 
ACCAACATAGTGAATCCCCA.TCTCTACTAAAAATACAAATATCAGCTGGGTGTGGTGGT 
GGGCACCTTTAGTCCCAGCTACTCTGGAGGCTGAGGCAGGAGAATCGCTTGCACCCAGT 
AGGCGGAGGTTGCAGTGAGCCAATATCGCATC^CTGCACTTCAGCCTGGGTGACAGAAT 
GAGACTCCATCTCAAAAAAAAAAAAGATGTGAGGATAAGAAGAATGTTACAAATTTATT 
TTTTTTAJ^AGTTACTCCAAACATATATGAAATATGGGAAGTCTAGGAGAGTCACTGCTT 
TCTGCAGGGAGGTGATAATTAATTAGTTTAACTTTAGATGATGGCA.GAACAAGCAA.TTA 
AAAATATCTAATATTGAAATAATAATTATTTTTATTATAGTTTGTCATCACTAATGAAG 
ATTTTCTTTGTATCTTTTAAACACAGGTAGACATGAAAGATGTCACAAAAATCAACAGA^ 
GAAAi^3GCAACATTCAAJGCATCCAAGGTA^ 

AATTGAJ\CAAAATTGGGCAAATACTAAAC\ACTAGAA-TAACACAATTATTTATAT 
TCATTGGATCAJTACATGTAACTTAGGGGATGAGAG^ 

TTGAAATTATAC^.GTTTCTTAAATTCAGCCTAAGGAGTTTGAACTTCATTCTGTAAGTA 
TTAAAGGCTAAAGTCATGTGAAATTTGCTTTTTCAGTGATG 

TGAGTATAGTTAGAAGTAAGTCAGCCAAGGCTTAGAAAAGGGTGATTGTTTCACAGAAT 

AAGAAAGAAAAJITTTATATCTTTACAAGA^ 

CTTAGTAAGTGGTTTACCAAAAAAAGACTTCTAAAATC 

GGACAAGTGATAA.GATTACTTTTTTTTTAACTTGTGCATTTTGAJ^ 

AAAACTGGCATTTTAATGAGAACCTACATAGTCATCAATTCCCATACGATACTTTGA 

TCTCAGAJ^AGTATAATTTTGATTTTATCGCATTTCTGGTGATGTTTCAGTGA.GGCTGAA 

TTC\GTGAAATATTTGGCTTTTAGTTTGTTGAGTGGGATATGTGAJ\.TTCCAGGCCAAAG 

AAGAATTTCTCTTGAGTACAGAGAAATAGTACTACTTTTCCCTTCTCAACATAATATTT 
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CCTCATACAATTTAAGAAT 

GTGGCTCATGCCTGTAATCCCTGCACTTTGGGAGGCCAAGGCGGGCGGATCACCTGAGG 
TCAGGAGTTGGAGATCAGCCTGACCAACATGGTGAAJ\.CACTATCTCTACTAJ^J^.TATG 
GAAACTAGCCGGGCATAGTGGCGGGTGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGC 
AGGAGAATCATTTGAACTTGGGAGGCGGAGGTTGCAGTGAGCTGAGATGGTGCTGCTGC 
ACTCCAGCCTGGGCAACAGA.GTGAGACTCTGTGTCAAAAAAAAATTCATATATATATAT 
ATATATATATATATATATATATATATATATATATATGTTTTTATATATACACACATACA 
CATACATATGTGTGTATGTGTGTGTGTATATATGTGTGTGTGTGTGTGTGTGTATATAT 
ATAATTACCTCAGGGACCTATATTAACTGTCTGGCTTTAGACAATTCCCATTTGGATGC 
TTCTCTCAGCTGCTTGAGACTGTCTGAATTTAGCTAATAATTTCAAGGCTATTAAATTG 
GGC AAGAAATTTTGGAGATCTGCTTTTCCTATTCTCA^^ 

TTAGTCTGTTATAACAGAGAGGGAAGAAATCTTTGGTGGGAAGGCAGGGGTTACApCTG 
GTAGTTTACCAGTTATTTATAGCTATTTACAAGTAATGAAGATCATCAGGCAGGAGGGA 
TATGAATTAAAAAACTATACTGAACAATTGACTAGTGATAGTTTTCTACTTTTAJ\A-AGC 
CTCC^TTAGAAAATGTCTAATGCACAAAATAGTTTATTACAATATTGGAAATATATTTA 
AAATGTAGAGCATATCATCTTCAGTAGGAAAAGTATCTAAATCCAAACCACGGCAGTCA 
AACTAGGAAAATGTTAACTTGTATAGTGCCAATAGAATGGGAAAACGTAAAGCTTAAGA 
ACTCTTCCCCTGGATAAAAATTTCAAATATATATTTCC^TTAAAATTTTATGACCCTAT 
ATATTTAAATTCATTTTTATGTATGTGTTGCACCTAATAGGACTACAGGATCTCA.TCCC 
AAAAGCGAAAACAAAATAGTACCAAGTTCATGTGACA.GTCTC 

GCCAAAAAAGGTTGAAATGCAAAGATCACTACCTGGCTCACTTTTACCCTTAGAGAACC 

CAAAAGACACTTGCGCATCAAAGTCGGAAAGCAAAATAGACTTAACTGTTTCATCTGAT 

CACTTCAGTGGACAACAGGAAAATTTAAATAGCATGAAGAAAAGAAATGTGA^ 

TGCTGCTGAAACAAAAAGTGATAAGAAAGATTGTGCTGCTTTTGCAATTTGTGACCAAA 

AAAGTGTACATGGCACATTTTCACCAGACCATGGGACGCTTTTGCAGAAATTTCTTAAA 



TTCTCTTGTGCCTGATGCGTCGGTGCTCTCAAAGCCAATTTTCTGTTTTGTGAAAGATG 
TCC\TCCTGATCTAGAAATGAATGACACAGTCTTTGAACTTCAAGATAATGATATAGTA 
AATTCATCTATTAAAAATTCCTCATGCATGACTTCTCC^GAACCCATCTGTATCCAGAA 
CAAAATTCCTACTCTGCAGATAAACAAACTACAGCCTACAGAAACTGAGTCAGAGGACA 
AATAGATGAAGGATAC^TTGAATCCCAATACTGTGCATACTTTTGGAGCATCTGGGCAT 
ATAACCCTTAATGTGAATCAAGGAGCAGAGTACTCTCTTTCTGAACAACAGAATGACAA 
AAATTCAAAAGTCCTAATGCAGAATGCTGCCACATATTGGAATGAACTTCCACAGTCTG 
CATGTAACCCAACATATAATTCTTCTGAGCATTTATTTGGAACTTCATATCCATACTCT 
GCTTGGTGTGTTTATCAGTACAGCAACAGC^TGGCAATGCCATTACCCAGACATACCA 
AGGGATAACATCATATGAAGTACAGCCATCTCCTTCTGGGCTGTTGACCACAGTTGCAA 
GTACTGCCCAGGGCACACATTCTAATCTTCTGTACTCTCAATATTTTACTTATT , TTGCG 
GGGGAGCC^CAAGCAAATGGCTTTGTGCCAGTGAATGGGTATTTTCAATCTCAAJ^TACC 
TGCTTCTAATTTTCGGCAGCCAATTTTTTCACAATATGCTTCTCA.TCAGCCATTA.CCAC 
AAGCTACATACCCTTACCTTCCTAATGGATTTGTGCCTCCAGAAGTTCCTTGGGTTTAT 
GGTGAGTTTCACATTTTAATGCCTGCTTTATTGAGTTGTTACTTTTAA 

Fiffure 59a 

SEQ ID NO.: 59 hSPGIS cDNA sequence * 

CGGGGCAGCCTAGGCCGGGCGAGGGCCATGCTGAGCCTCGCAGCCAAGCTGGTGGCCTT 
CTTCTGGAGGACGGCGGACACCCCTAGGGAGGAAGCCGGGCAGCTGGAGCCCGAGCTCG 
CGGAAGGTGACACTAAGCTGAAAACTGTACGC^GTGTCGTGACAAGGTACTGCAGCGAT 
TATGGCATGATTGATGATATGATCTACTTCTCCAGTGATGCTGTGACTAGCAGAGTGCT 

TCTGAATGTTGGACAGGAAGTGATTGCAGTTGTGG^^ 



WO 01/66752 



47/108 



PCT/US01/07371 



Figure 59b 

TGAAAGCWC\GGGTAGAAGCTGTCTCTGATAAGTGGGAAGACGAC\GCAGAAACCAT 
GGGAGTCCCTCAGACTGCGGCCCCCGAGTGTTGATTGGCTGTGTGACTTCCCTGGTGGA 
GGGCGC^GGCTGTATCAGTCAGACCACCTACTTCTCTCTGGAGAGTGTGTGCGAAGGCT 
TCGAGCCCTGCAAGGGAGACTGGGTGGAGGCTGAGTACCGGATCCGGCCTGGCACGTGG 
AGCAGCGAAGCCACCTCAGTGAAGCCACTGAGATACAAGCGCGTGGACAAGGTCTGCAT 
CTCTAGCCTCTGTGGAAGGAACGGGGTGTTAGAGGAAAGCATCTTCTTTACCTTGGACT 
CCTTGAAACTGCCAGATGGGTACACACCCCGGAGAGGTGACGTGGTCAATGCAGTGGTG 
GTGGAGAGCAGCCAGTCATGCTATGTCTGGAGGGCGCTTTGTATGACCCTAGTGAAGAG 
GCGAGACGCCGCCCCTGTTCATGAGGCCACTCATTTCTATGGAACGATTTTGCTGAAGA 
ACAAAGGTGATATTGAAGTTACACAGGTGACGCATTTTGGAACCCTAAAGGAAGGAAGA 
AGTAAAACCATGGTGATCTGGA.TAGAGAATAAAGGAGACATTCCTCAAAACTTAGTCAG 
CTGTAAACTGGCTGGCTGGGATAJ^ATCTAJVACAAJTTC^ 

ACCAGATGTGCCCCGTGGTATCTTTTGTTTCTGTTCCTGAGAAGGAGAATTCATCAGAT 

GAAAAT ATT AATT C ATT AAATAGC C AC AC AJLAAAAC AAAAC C TC TC AGATGTC GG AG AG 

CAGTTTGGTGAACAACAGAGGAATCTCTCCAGGTGATTGTACCTGTAAAGGAGAAAATG 

GAGAAAAAGACAACATTCTATCAAGGAAGCAGATGACAGAGCCTGAGCCTGGGGGGCTT 

GTCCCTCCAGGGGGAAAJ\ACCTTCATTGTGGTCATCTGTGACGGAAAAAATCCTGGCCG 

CTGCAAGGAGCTCCTTTTGCTTTGTTTTTCCGATTTCCTAATTGGGCGA.TACCTTGAAG 

TAAATGTTATCAGTGGGGAGGAGTCACTAATTGCTGCGCGCGAACCATTTTCTTGGAAA 

AAGCTTAAAAGTTC^CAJ^GCGTTAJ^CATCCGCAAAJ^CTACAGTTGTTGTGACCGCACA 

GAAAAGGAACTCAAGACGACAACTTCCAAGTTTTCTTCCCCAATA.TCCAATCCCAGATA 

GACTTAGAAAATGTGTGGAACAAAAAATTGACATCCTGACTTTCCAGCCATTACTTGCA 

GAGCTTCTGAACATGTCAAATTACAAGGAGAAGTTTTCGACTTTGCTGTGGCTTGAGGA 

GATTTATGCLAGAAATGGAACTGAAAGAGTATAACATGAGCGGGATCATCTTAAGAAGGA 

ATGGGGATCTGCTGGTTCTGGAGGTCCCAGGGT^ 

GCAGGTGATAAACTGATTTTAAAJ^ 

CAGCTACGTGACTGAGA.TTCLATGAAGAAGATGTAACTCTTAAJ^TTAATCCAGAATT^ 

AACAAGCCTATAACTTTGAJ^CCTATGGATGTGGAATTTACATATAATAGGA.CCACAAGC 

AGACGGTGTCACTTTGCA.CTTGAACACGTCATCCACTTAGGTGTAAAAGTGTTGTTTCC 

AGAAGAAATTATTTTACAGTCTCCACAAGTGACGGGAAATTGGAACCATGCACAAGACA 

CCAAAAGCAGTGGACAGTCCACCAGCAA\AAGAATAGGAAAACAATGACGGACCAAGCT 

GAGCATGGAACAGAGGAGAGGCGTGTTGGTGACAAGGACCTGCCGGTGCTGGCACCCTT 

TACTGCAGAGATGAGCGATTGGGTAGATGAAATTCAGACCCCTAAAGCAAGAAAGATGG 

AGTTTTTCAACCCAGTGCTAAATGAAAATCAGA^GTTAGCAGTTAJ\AAGGATTCTGAGT 

GGTGACTGCCGTCCCCTCCCGTATATTCTCTTTGGACCTCCTGGTACTGGAAAGACAGT 

GAC^TAATAGAGGCTGTTTTACAGGTACACTTTGCCTTGCCGGACAGTCGGATTTTAG 

TCTGTGCGCCCTCCAACAGTGCTGCTGACCTCGTGTGTCTGCGGCTGCACGAGAGCAAG 

GTGCTACAGCCGGCCACCATGGTCCGGGTGAACGCCACCTGCAGGTTCGAGGAGATAGT 

TATTGACGCCGTCAAACCGTA.TTGCAGAGACGGAGAAGACATCTGGAAAGCCTCACGCT 

TCCGGATAATCATCACCACATGCAGCAGCTCAGGGCTGTTTTACCAAATAGGAGTGAGA 

GTTGGGCACTTCACTCACGTGTTTGTGGACGAGGCTGGGCAGGCAAGTGAGCCGGAATG 

CCTCATTCCTCTGGGGCTGATGTCGGACAT.CAGTGGCCAGATCGTGCTGGCAGGAGACC 

CCATGCAGCTCGGACCAGTCATTAAGTCCAGACTCGCCATGGCCTATGGGCTGAACGTG 

TCCTTTTTGGAACGGCTGATGTCTCGACCCGCGTACCAGAGGGACGAAAATGCTTTCGG 

TGCTTGTGGCGCACATAJVTCCCCTGTTGGTCACAAAGCTGGTGAAGAACTACCGGTCCC 

ACGAGGCCCTGCTGATGCTGCCCTCACGGCTGTTCTACCACAGGGAACTCGAGGTCTGT 

GCGGACCCCACAGTGGTGACCTCCTTGCTGGGCTGGGA.GAAGTTGCCTAAGAAAGGCTT 

CCCTCTCATCTTCCATGGTGTGCGGGGCAGCGAGGCACGGGAGGGAAAAAGCCCATCGT 
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Figure 59c 

GGTTCAACCCGGCCGAGGCCGTCCAGGTCCTGCGCTACTGCTGCCTCCTGGCCCACAGC 
ATCTCCAGTCAGGTGTCTGCCAGCGACATTGGCGTCATCACGCCCTACCGGAAGCAGGT 
GGAGAAA^TCAGAATTCTTTTGCGTAATGTTGATCTGATGGATATAAAGGTTGGATCAG 
TAGAGGAGTTTCAAGGACAA.GAGTATCTGGTCATCATCATTTCGACCGTACGGTCAAAT 
GAAGATAGATTTGAJIGA.TGATCGATATTTTT 

TAATGTTGC^TCACCAGACCCAAAGCTTTGCTGATAGTGCTGGGAAACCCCCATGTTC 
TCGTTCGAGACCCCTGTTTTGGTGCTTTGCTGGAATACAGTATTACAAACGGTGTTTAC 
ATGGGATGCGATTTACCTCCTGCACTGCAGTCTCTGCAAAACTGTGGCGAGGGGGTGGC 
AGACCCCTCCTACCCAGTGGTGCCAGAATCCACAGGACCAGAGAAGCATCAGGAGCCCA 
GCTGATCTGCAGTGGCTGACAGCAGGGAGGCCATGTGCTCAGCCTGGCCACGTTGCCGT 
TACAGTCTGCTCCGTGGCTCCTGTGGCCTGCCCTTGTCTCGCAGCCAGGCA.GGGTCGTG 
TGTGGGTGTGGGGCTGCCAGGTTGGACGCAGCTGCTGCTGCCCTGACTTTGGCATATGC 
CAGCCTGTT-CCTGCCACAGGGCAGTCACTGCCGCCTACCCTGAAATAAACCCTCGAGTG 
ACCCCCAAAAAAAAAA 

SEQ U> NO. : 60 hSPG15 encoded protein sequence Figure 60 

MLSLAAIEjVAFFWRTAJDTPRE 

FSSDAVTSRVLLWGQEVIA.VVEENKVSNGL^ 

VLIGCVTSLVEGAGCISQTTYFSLESVCEGFEPCKGDWVEAEYRIRPGTWSSEATSVK^ 

LRYKRVEKVCISSLCGRNGVLEESIFFTLDSLKLPDGY^ 

WRALCOTliVKRRDAAPVHEATHFYGTILLKNKGDIEVTQ 

NKGDIPQNLVSCKLAGWDKSKQFRFQMLDKDQMCPWSFVSVPEKZNSSDENINSLNSH 
TKNKTSQMSESSLVNl^GISPGDCTCKGENGEKDNILSRKQMTEPEPGGLVPPGGKTFI 
WICDGKNPGRCKZLLLLCFSDFLIGRYLEVWISGEESLIAAREPFSWKKLKSSQALT 
SAKTTVWTAQKRNSRRQLPSFLFQYP 

EKFSTLLWLEEIYAEMELKEYOTISGI^ — - 

QEYttGHAIEYISYVTSIHEEDVTLX 

VIHLGVKVLFPEEIILQSPQVTGNWNHAQ 

GDKDLPVLAPFTAEMSDWVDEIQTPKARKMEFFNPVI^ 

I^GPPGTGXTVTIIEAVTjQVHFALPDSRILVCAPSNSAADLVCLRLKE 

VNATCRFEEIVIDAVTCPYCRTCEDIW^ 

DEAGQ ASEPECL I ? LGL21SD I SGQ I VLAGD PMQ LG P VIKSRLAMAYGLNVS F LERLMSR 
PAYQRDENAFGACGAHNPLLVTKLVKNYRSHEALL^ 

LGWEKLPKKGFPLIFKGVRGSEAREGKSPSWFNPAEAVQVLRYCCLLAKSISSQVSASD 

IGVITPYRKQVTlKIRILLRTTVD 

FLGFLSNSKRFNV1\1TRPKALLIVLGNPHVLVRDPCF 

QSLQNCGEGVADPSYPWPESTGPEKHOEPS . SEQ H3 NO.: 51 hSFG15 
genomic DNA sequence Figure 61a 

GGGGGTCACTCGAGGGTTTATTTCAGGGTAGGCGGOVGTGACTGCCCTGTGGCAGGAAC 
AGGCTGGCATATGCCAAAGTCAGGGCAGCAGCAGCTGCGTCCAACCTGGCAGCCCCACA 
CCCACACACGACCCTGCCTGGCTGCGAGACAA.GGGCAGGCCACAGGAGCCA.CGGAGCAG 
ACTGTAACGGCAACGTGGCCAGGCTGAGCACATGGCCTCCCTGCTGTCAGCCACTGCAG 
ATCAGCTGGGCTCCTGATGCTTCTCTGGTCCTGTGGATTCTGGCACCACTGGGTAGGAG 
GGGTCTGCCACCCCCTCGCCACAGCTGTAGACAGAGGAGAAGCGGATGGCCAGTGAGCC 
AGGCTCCCACAGCGGGCTGGGAGCTGCACTCTTTTCCCAATGTGGGTTTTACAAGGGGA 
CTTAGTTTACCCTGTTAGCCTATGTGGGAJ^AGGTGACATGACCCCAAATGTCCAGGAAA. 
CAGTGGCTGCTGCAGCCCA.GGATGAGGTGAGGACGGTGGCCGGCAGAGGGCTAA.GGCTG 
CAGGTGGGTAAGTGGATGGGGGTGAGGGGAGCAGGGGAGGGCAGGCTAGAGCACGTTCT 
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Figure 61b 

AGAGCCA.GCCTGTCTTTGAGGAAGACAGCAGAAGCGCTCACTTTTGCAGAGACTGCAGT 
GCAGGAGGTAJ\ATCGCATCCCATGTAAACACCGTTT^ 

ACCAAAACAGGGGTCCTGTTGGCAATAACAATTAGCCTGGTGTGA-GAGCAAGTGAA.GGC 
CCCATGTCTCCTGGTGCCTCGTACGTCTCACCCCGA.GTGACCCGGACCCTCCCTCCTTA 
AGTGATC^CAGCCCACCCAGTGCCTGAAAACTCACTCGAACGAGAACATGGGGGTTTCC 
CAGCACTATCAGCAAJ^J3CTTTGGGTC 

ACAAGAAACCCAAAJ\AATATCGATCATCTTCAAATCTATCTTCATTTGACCGTACCTAA 
AAGCACACAGAGATGAATAAAAATGGCACTTGAAGTTTTTGGGGGGCATGAAGAGAJ^C 
TCTTGATATATAAAJ\ACATTTAAAAATGT^^ 

AAAAAGAAACAGGAGGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCCTTTGGGAGG 

CCGAGGCGGGAGGATCACGAGGTCAGGAGATCGAGACCATCTTGGCTAACATGGTGAAA 

CCCCGTCTCTACTAAAAATACAAAAAAAAAAAAATTAGCTGGGCGTGGTGGTGGGCGCC 

TGTAGTCCCAGCTACTCGAGAGGCTGAGGCAGGAGAATGGCGTGAACCCGGGAGGCAGA 

GCTTGCAGTGAGCCGAGATCGCGCCACTGCACTCCAGCCTGGGCGACAGAGCGACGCCA 

TCTCAAAAAAAAAAAAAGAAAGAAAAAAAGAAACAGGAGGTGGGGGGAGGGGGAGAA^ 

GGGAGGAGGAAGGGCAGCAGAAAGGGGTGGGTCTCATCGGCACTCCATCTGCAGACAGG 

GAGCCCTCACGCTTGCGGCTGTCTGGATGGCGGGGTCTTAGGGGCTCTCCTGCGGGTAC 

CTGCCCTCCCACTCTCTGCTCCTCAGCTGCCTGTTCTTCCAACTTCGTGTCCCTCCTTC 

TCCAAGGCATTCACCATGTCTTCCTGGGTCCCTTCTCTTTACTCTCCTGTGTCTTAGTC 

TTTGAGTATTGTTTTAAAAGACCTTCCC^CAGCCACTGCTTACAATGGCTCCTGGACCT 

AGGGAGTCTCCGCCTGCAGCTGCTTCTCTGGGCTGGGAATCAGCCTCTGCCCCTTTAGA 

TCTGAAGCCCCCAGAACCCCCAGGGCAGCAAGCACCTGACTGTCCATCCCCACGGAGAA 

CAGGGATACCTACGGTCGAAATGATGATGACCAGATACTCTTGTCCTTGAAACTCCTCT 

ACTGATCCAA.CCTTTATATCCATCAGATCAACATTACGCAAAAGAATTCTGAT^ 

CACCTTGAGAJICAGATTAAAAGAAAGACTAAAGATAACAC^ 

TCTTTTCCTTTTTTTTTTTTG^ 

TGGCGTGATCTCGGCTCACTGCAACCTCTGCCTCCCGGGTTCAAGCAATTCTCCTGCCT 
CAGCCTCCTGAGCAGCTGGGATTATGGGTGCATGCCACCACACCCGGGTAATTTTTTGT 
ATTTTTTAGTAGAGAGAGGGTTTCACCATATTGGCCAGGCTGGTCTTGAACTCCTGACC 
TTGTGA.TCCGCCTACCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCACG 
CCCGACATGTACAAATCCATCTCTTTACACATCACAGCA.TAAACCAJUVTATTGGCCAAG 
GTGCAATATTGTTTTTTAATTCTTTATTGCTTTCTTCTAATCTGAATTGATTAGAACAC 
AAATTGC^GTTTTAGTTAAJ\CTGGGGACGGGGGACAATGCTTACTAGGAAAGCCAJ\GTT 
TCATGTGG AAAA.TCC TC C CTCCCCAC AAGC ACAT ATC AAGGATGGAGGAAATC C TTC AG 
GAGAAACGAGATGCATGTCTGATGCTTAGAGCTGAGGATGGCAGCTCTAAGCTGGATGT 
CTCAGCAGGATGGCACCTACAAGCTGAGGTTAAGACACCAGGACACTGGCTGTGTGA.TG 
TCTGGCAACTTACTTGTCTCCATAA.GCCTTCGTAATCTCTGTACCTCTGGGAGAACTAA 
GCACTCACCTCTCGAGTCCCCGAGGTCAGAAAAAAGGAAJ^CATAACATCTTTGACGGGC 
TCTGCGGTCAGCACACGCGCGGTACAGGTGATCTGTTACTGTGATGGCCTAGGGGACTC 
ACCTTT7CTTCTGTATTTTGGGGGTTCTTTC 

CAGGGACACATGCTTCATGAATGAAACCTGGTGTTCGGCA.TGGGAGGCACCTGTGTCCC 
CACCAGAGCTCCAGAGCCACTGTGAACGTGTGGAGATGCTTCCTGGGACCAGCCCCTGG 
GGCCTGGGGACGCTAACTCTCCTATAATCAC^TCACAGAGTCATACGAAATGAGAJ^GCG 
TCCCATTCAAATATACCCAAJVGTGCCCCCTTTGAGAAACACTGGGTTAGAGCTGGGGGT 
TGGCAAATGTTTTCCCAAAGGGCCAAATATC^^ 

AATGTAAAAACTGTTCGAGGCTCTGGCCGTAGAAAAGCTGGCAGCTGCCCGGAACTGGC 
CCATGGATGGCAGTTGGGTGACCTCTCATGTTGAACGA.CATTCAAGTGCGGTCCTGCCC 
CTTTAAAAACCTGCTGCTGCTCACAAGGAGCTTGTTCCTGCCTCTGTGGA.GCTGCAGGA 
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GCC\AGGA(X:AaXC 

TGCTTCCGGTAGGGCGTGATGACGCCAATGTCGCTGGCAGACACCTGACTGGAGATGCT 
GTGGGCCAGGAGGCAGCAGTAGCGCAGGACCTGGACGGCCTCGGCCGGGTTGAACCACG 
ATGGGCTTTTTCCCTCCCGTGCCTCGCTGCCCTGGGTTGATGAGGAGAGAAJL-.CCTACT 
TTACTTCATGAATATTCACCAGAAGGCTACAATCCAGGAGCTGAAGTCGGTGCTGGGAA 
TAC^CACTGAATATGACAGAATAAATCCCTGCTCTCGTGC^GAACTTACATGTGAGCAG 
GGCAGATGGCATATGTTAAATGATGAAATGGCTCACAGTCATGCTAAACGCTACAGAGG 
AGGACGGGTCACAGGAGTCCGCCACTCAGGTCTGAGCCCCGGAGTGGAGGGCTGGCTGG 
TATTTCTGATCTTTTTTTTTTTTTTTTTTTTGATATTAGACCTTTGATATAGCTACTTC 
TCATCTTGAAAATACTCTTCCTTAGAAAACATTAATGAGATATACTGACATAAAAGACA 
GACATATAGATCAACAGGGTAGAACGGAGAACTTAGAAATAAACCCTCGAATATACGGC 
CAGATGACTTCTGACAAGGGTGCCAGGACAGTTTTTC^.CCAACGCTGCTGGAAAJLACT 
AGAGAGCCACACGGAAAGGATGAAGACGGACCCTC^CCTGACACCACCTGGAAACAGAA 
ACTCAAGGTGGA^CGAAGATTTAAACCCAAGACCTAAATCCA.TAAAATTCTTGGAGGAA 
AACAGGGAAAAAGCCCCACGACATTAGGTTTGGCAATGACTTCTTGGATACGACACCAA 
AGCCACTGGC^ATAACTAAAAGGGATATGCTGGACTACATCCAAATGGAAAATGTCTGG 
GC^TCAACGGACGCAATCAACAACGTGAAAAGATAGTCTGCGGAGGGGGGAGAAAACCT 
TTGCAAATAGTATATAAGAAGTTAATATCCAAAACTACGAGGAAGTCTTAAATTCACTG 
ACAAAAAGCGACCCAATTAAATATGGGCAAAGGACCTGAATTCTCCAAAGGAGACATAC 
AAATGGCCAACAGGTACATGAAAAGCTGCTCAATCT^ 

ATCAAAACTACAGTGAGCTACACCTCACTCCCACTAAGATGGCTACTATAAJIACAACCC 

AGAAAATAAGTGGTGGTGAGGATGTGGGGAAACTGGAACCCTGCGGCACTGTTGGTGGG 

GTGTGGA^TGCTGCAGTTACTGTGGAAAGCAAGATGATGGTTTCTCAAAAAJVTTAAGAC 

TAGAATTACCAACAATTCCACTTCCGGGTATGTACCCATGACGACTGAAAGCAGGGTCT 

CAAATAGATATCTGTACACCCGTTGTAGCAGCACTACTTACAATAGTCAAATGTCTGCA 

GCCCLAkGTGTCCACTGATAGATGAACTGATACGGCAAGT^^ 

AATGCTACTCAGCCCCCAAAiUSGGAGGAAATTCTGAC^ 

CTTGAGGACATGATGCTCAGTGCAATAAGCCGGTCACAGAAGGACAAACACTCCATGAT 
TAATTCCACTTGTATGAAGTACCTAGAACACCTGAATTC\TACAGTCAGAAAGTAGAA.T 
GGTGGCTGCTAGGCACTGCAGGGAAGAAAGTTACTGTTTCATGGGTATAGAJiTTTTAAT 
ACTGCAAGATAAAAGTTCTGGAGACTGCAAAACAATGTGAATATACTTAACACTATTAA 
ACTTTACACTTAAAAATGGAGTCCAGGCGTGGTGGCTCACGCCTGTAATCCCAGCACTT 
TGGGAGGCCGAGGTGGGCAGATCACTTGAGGTCAAGAGTTCGAGACCAGCCTGGCCAAC 
ATGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGGCGTGGTGGGGCATGC 
CTGTAGTCCCAGCTGCTTGGGAGGCTGAGGCATAAGAACTGCTTGAACATGGGAGGCGG 
AGGTTGCAGTGAGCC^AGATC^TGCCACTGTACTCCAGCCTGGGTGACAGAGTGAGACA 
ACTGTCTAAAAAAAAJ\AAAAAGTTTTT^^ 

ATTTATGAATGAATGAATGAATGAATGAATGAGACAGGGTTTCACTCTGTCGCCCAGAC 
TAAAGTGCCAGGGCATGATCACAGTTCACTGCAACCTCAACCTCTCCAGGCTCAGGTGA 
TCCTCCCACCTCAGCCTCCTGAGTAGCTGGGACCACAAGTGTGCACCACCATGACCGGA 
TAATTTTTGTATTTTTTGTAGAGACAGGGTCTTGCTATGTTGTCCTGGCCGTTCTTGAA 
CTCCTGGGCTCAAGCAATCTGCCTGCCTTGGCCTCCCACATCCGGCCCTAGAATTTAAA 
AAAGTGACCACCAGGCGGGGTGCGGTGGCTCACGCCTGTAATCCCAGTACTTTGGGAGG 
CCAAGGCGGGCAGATCATGACGTCAGGAGTTCGAGACCAGCTGGGCCAACATGGTGAAA 

CCCCGTCTCTACTAAAAATACAAA^^ 

C\GATCCTTGGGAGTCTGAAGCATGAGAATTGCTTGAAACTGGGAGGCAGAAGTTGCAG 
TGAGCCA\GATTGCACCACTGCACTCCAGCCAGGGTGACAGAACAJ\AACCCTGTCTCL^A 

AAC^JL\C\GATAAACA?L^^^ 
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GAAAATTAGTTGGACGTGGTGGCAGGCGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGG 
• CACGAGAATCGCTTGAA.CCTGGAAGGCGGAGGTTGCAGTGAGCCGAGATGGCGCCACTG 
CACCCCAGCTGGGGTGAJL^GAGCAAAACTCTGTCTCAAJ^AA^ 
ACAACCCCCAAAAAAGTGACCACCAGATAAAGCAGTAAGAATTAATTTT 
TTAAACCATTAACTTACTCATTTACT^ 

TTAATGGGATGGAAACTTATAGC^GAAACTTCTAAACTGAGAAACAATTCAATGAA 

CAACCAATCAACTAGAAAAAATCAGGAAATAAAATGACTAAGP^ 

CATGTGAAATCACTATAGACTGGGCTGATTTATGTCACTAAAAGCAAACTGACTCCCAA 

AATGGATTAAATATAAAAACCTAGCTGCTCCTAAGAAAT^^ 

GGGCAAAGACATACCAGGTAAAAAGAAAAAAGTGGCAAJTC 

TTTAAGACTAJITACACATGAC^GTATG^^^ 

GTGATAAATTATATGCATGAAACACAGCAGTTAAAATTCGTAAGACAAGAACGA.TCAGA 
AACCCAAGGATAATTTGATGAGGAGCACGACGAGAACGAGGGGCTTCATGACTCTTCTC 
CGCCATGGCCTACCAGGTGGACAGATCCTAGGCCAGGA.C^C^GAGCTGCTGAGTGACA.G 
CCTCC\TGCATTTATTTATATTTATTTATATAGTTTTTAGAGTCAGGGTCTCACTCTGT 
CGCCTGGGCTGGGGTACAGTGGTGC^ATCATAGCTCACTGCAGCCTCLAATCTCCTGGGC. 
TCAAGCAATCCTTCTGCCTCAGCCTCCCAAGTAGCCGGGACCAAAGGCACATGCCACC\ 
TGCTCAGCTAATTTAAAAAAAAAAAAATTTTTTTTTAAGAGATGGGAGTTTCACTATGT 
TGCCCAGGCTAATCTTGAACTCCTAGGCTCAAGCAATCCTCCTGCCTC^GCCTCCCAAA. 
GTGCTGGGGTTACAGGTGAGCCCCTGTGCCTGGCCACTAATGTGTTTTCCTGATACCCA 
GGAAAGCTTCTGAGGATGGGCAGAGCTAGCAGGACAGATGGCGGGAGACCACTGCAGGG 
GAGGGACCTGCCTCAGAGTGCACCATGTCTGGAGGTGTCdGCTCACTGCAGATTCTCC 
TTAGGACCCCTTCCTTTCCAAATGTGAGAACCTCACAGCCGCCATACACA.TATCCCATC 
CCACCACTTTGGACAGATCA.CAGACAGATTTTCTCCCTCAGGAACTTACGAJIGAGCAAC 

TGGGGGCCACTGCTGGGTAGGGCATGGAATG^ 

i^CCT<^ACCCC~TC^ 

CCCGTGTGGCCCGGGTGOVGCCCTCACTGCAGATACCTTGAATGCCCCTTCTTCTGGGG 

CAGGGGGTTAATGCAAAACTTGATTTCTGCTCTAACTAAACTCCTA^TACAAACTA.TCT 

CATTTTAGCGAATGCTTGTAATTTTGCTTGGTCACTCAGCATCTAACTCTGTCCCTCTG 

CGAGGCCTTGGAGAGGGGGTGGTGGCCCGTCACTCCTCAGGCCCCAGGCGCACGGCTCG 

GGATCGCAGCTGCTGCCTTCCCTGCCGGGCTCTCTCCTGACAGCTCCAACCCTCGAGGG 

AAGCACCGGGAGGGAGGCGGTGGTGGCCTCTGGGAGCTTCCTGCGGCCCTGGGGCGTCC 

CCAGCCACATGGCCCTGCGAGCTTCCCCAGGCCCTCCAGAGCTC^CTGGGCAGCTTCGC 

TTCTCTTGGAGCCCAGTGCTTGTCTGCTGTGACCACCTGAGAGTGGTGCCTGTGA.TGCC 

TCCTTGGTGCCAGGCACAACTCATCTGGCAGGAGAAGTCAGCATCCATCTTGGGGGGCA 

CAA.GGCCCCGATGGCTTTTCACTCTCTGGAGACCAGCAGAACATCTGCCCCGCCACAGG 

CAGGTGACAGAGCCTCACCTGGGAGGAAGGACCTCCTGCCATACTGTCCTTACCCGACA 

CGGTCACCTACGGATTCAAGACAaA^CTCACCCGCACACCATGGAAGATGAGAGGGAAG 

CCTTTCTTAGGCAACTTCTCCCAGCCCAGCAAGGAGGTCACCACTGTGGGGTCCGC--CA. 

GACCTCGAGTTCCCTGTGGTAGAACAGCCGTGAGGGCAGCATCAGC^.GGGCCTCGTGGG 

ACCGGTAGTTCTTCACCAGCTTTGTGACCTGCGAGGAGATGGTGATGGGTCACAAGGGG 

AAGGTGGTACCCGGC\C\GTTTCTGTCGAJICGAAJL\GCGGCTCTGCA^ 

GGCCTGCCTGACTTCCAAGCCTGACTCTGCTCCTGCGCCCCTGTGTGTCGCCTGCCCCC 

AGACCAAGCCCTTGGATGATGGTCAAGCC AAGGGTCCATCTGAATAGi* A AATGGAAACA. 

TCAGGAAGAAAAGGGTGTGACGGCTGCCCTCTGTTCCTCTCCTGGGTCTGGGTGCCGCT 

gaggggcctcttaaagggctcctgccacctggaacccagcagccgggtgga.gggcaggt 

ggatgcctccctcgtgtgctgagagcttcaccctggaaj^^ 

c^gccctgcctccgcttggccac\gggaaggaagac<:tcctgggcccctgggccaggag 
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AGCATAGACACTGTGACTCTGGGGGCCTGGCC^GCCCTGGGCGGGCCTGGCTGTGACTT 
CACTGATGCGGCTGAGGGTGACTGTGATATCACC\GTACATGGGACACCTTTACTGTCT 
GCCTCCAACTTGCTCGGGAAGCTCACTCA.GGGCAGGTGCTCTTCCCCTGGTGCTGGTGT 
AAGCAGCCAGAGAGGGTCTCCAGCGACTACAGGA.GTTCAACCAGTCAGAGCAATGCTCA 
GTCTTGACCCTGCTCATCACGTGACCCATGTATTGGCATGAGGACTCGCTGCGTCTGTG 
CCGCGGGGAGCCGTCCTCTCTATGOUITGCTGCACCCTCTGCCCTCCTGTCACTGACTG 
TCCCTCGGGGAGACACTGCTGGGGAGAACAAGCCTAGGGCTCTCCCTCCTTGGGATGAG 
TTAACAAAGCCCCTGTTGGTGAJIAACCTGTGTTCCTGCAGTGCTGTCTGTTACTCGC^ 
GATGACACTGGCACGAGAACGCCATGATTGGCCCCGTGAGTCCTCCCGACTCTGTCAAG 
CCTAAATAGAATGTGTCCA.GCCTCCAACGTGCTACTCCAAACTCCAAATGTGA.TCTAGA 
GTCACGGCACCTCTGCCTTAGCTGGACATGGGCAGCTGCT^ 

TGC^GCCTCCTGGGGTCACTGTGGCAGCCGCTCTGGTTGGAGGGTGCAGGACACTpCCC 

AGGCCCCACACAGGACAJ^GGCCTTCACCTCCCGTGCGGCAGGGTCTTTCTGTACCTGTC 

GCCCTGTGCTGTATGCACCTGCCCTGAAGGCTGTGTGGTGGGGACGGCAGCGTGCTGGT 

CTCTCCTGCTGCGGCCACGTCACCTGCACCCACACCTCCAGCAAAGCAGCTAGCGGGTA 

CAGGCGCACAGCGGCACTGGAGGATGAGGCGAACCTCCGGGGGACAGCAGAGCTCTCAG 

GCGCCCTCTGGGCCCAGCAGCTGCTCACGGCCGGCCATCTTCCT 

GCCTCCTGGCACAGCCGGTCCTGTCTTGCGGGTAGTCCCTCATGAGGCCTTCCTTGGCC 

TGAAGACCCCCACCTGCTCCCTACCCTTGACAACCAAGAATGGGGTCCAGTCCTTTTCC 

CTGAAAGTCTGCCCATGGTAACTGCCCCTCATTTTCTATTGATGTTAATA.CTGGCAATG 

GGAATCAGATGGAGACTGTTTTTCTGTGGGCTACAGGGGCTTACAAACTGAGGCTACTT 

GGTTCACTGTTGAATGATGGGCCCTGGAGGCCCAAGCTC\GTGTGCTGGGGGCTCCAAA 

GATGCAAATCCCAAAGCCCATCCCTCAGAATTTGGTGCAAGGTTGGCTGAGAGCCCA.GA 

CCTGCACTTCAGTGGGACCTTGGGGTCAGAGACCACTGAGGAGACACTGCCTGCAGTCA 

CTCTAGGGCAGGCTTTTTAAGTCACAAAAATTATTTCTTGCTTTTCTTAACGTTAACAG 

AGTAACACAGATTTTAACTTATTTTACTGAAT^ 

CCAGGCTGGACT 

CCCACGGATGGA.GATGACTCAGGTGGACAGAAGAGTGAGCTGCTCGACTGCCCTGGAGC 

CTGTGTCTGCTTAGCTGGGGACCTGAACGCGCTGGAGTCTGTGACTCACCAACAGGGGA 

TTATGTGCGCCACAAGC^CCGAAAGCATTTTCGTCCCTCTGGTACGCGGGTCGAGACAT 

CAGCCGTTCCAAAAAGGACACGTTCAGCCCATAGGCCATGGCGAGTCTGGACTTAATGA 

CTGGGCCGAGCTGCATGGGGTCTCCTGCCAGCACGATCTGCACACAGAGGGCCCGTCTC 

AGCTCATGCAGGGAGGGTGCCGCTGCCACTCCTACGGGCTCAGGTGGGAATGGGGCTGT 

GCGCAGGATGGCCCCATGCTCACTCTTGCTGCAGGGGACGACGGGAGGGACTGGGAGAC 

CAGGGAATGCCGACAGGGCTTGGAGCCGTCTCTCCCCGCTTCCCCCTGAAGTCCTGCCA 

TCTGCCTCCCCGAJ^GTCCTGCCATCTGCCCGGCTTCTGC\GCCTGGTTTTCTCTCCTG 

GAATCTCTCAAAGTGGTAAGAGCCCCAAGGATGCCGTCTATCTTCTGGAAGCTTCTGCA 

TGGCTGGCCCTGGCCGCCATCCCACCTGCCCCTTGCCTCTTGCCCATGGTTGCCTTGAC 

CTGGTCAGGCCGCCCTATGCCGCTGCTCTGCCTGGACCAGTCCCTGCTCCCTCCCTAGG 

CGTCACCTTCATGCTCCGCCCCACAGTGCCTGGTGACTGGGACAGACATCTAGGATGGG 

TGCCAAGGAGGACCCTGAAATGTCCAGTGCTGGACACTGGATGAGCTCGCCTGGATGAG 

CCACAGTGAGAGAACCAGGGAAGTTCAAGGTGATGCCCACGCCACCATCAGAAGCACTC 

GGGGAGGAAGCAGGTGGGAGCAGAGAAC^GAGGATGCACAGAGCTGGAGGCCAGGGCAG 

CCTGAGGGGCAAGTGGGTCAGATGCAGGAGCACTGGCCCTGTCCGCCCCTCCAGGCACT 

GAACGGCCGGCTCA.CCCAGAGCAGGGACCACATTCCTGACTTTACCGCTGTATCCCCAA 

CACCTAGAATGGCTCTTACCTCCGTTTTACAAGTAAATGAGTCAAGAC^TTCAC 

GTCCAAAAGGAATAAAGTACAATAGAGACCCATGGCCCCCAACACCTCGGGGTCCTTGG 

CACTTCCACCTGACCCCTGTACAGCTGTGACTGTTCCCGGCAGGATCGCAGCCTGCATC 
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CCCTCAGGAGAACGGGGTGAGGAGGAAGC^GCACACACAGTavj i s. w GGACTTACCTGGC 
CACTGATGTCCGACATCAGCCCCAGAGGAATGAGGCATTCCGGCTCACTTGCCTGCCCA 
GCCTCGTCCACAAACACGTGAGTGAAGTGCCCAACTCTAGAGCAAAGATGTTCTAAATG 
GTAATTTCTGCCGAJITTGTTTTAGCAATTGATGCCTCTGACACACTGGCAAAGCCACGC 
TC^GCACCCCGGGGGCACAGC^GGGTTGCCCAGGGGTGTGCAGGAGCTGGGGGTCTGGG 
TGTCACCCGGCTGGAGGGAGGACAGGCCACAGGTGTCAGGAGATGCTAAGCCL^CATCAG 
CACCCAGTGCTTTGCACGCAGCGTCTGTATTTCTAGAGATGTGGAGGGAAGAAGGGGTG 
GGAGCGATGGGCAACACACGAGCGCCTCCTGGGGGCAGCTCACAGGAGAAGCGCCGCAG 
GCTCCTGGCCTGCTCTCCACCTGCGTCTGACTCCTGTCATTAT 

TAGTCCTGACTTCCCCTCTCTGAGCCTCCACCTCCCACGTGTCTCTCGGGAGGTGGCTG 

AGGTGAGAGCAGCTGAACTGGCCGCTGTAAAGCGCTGAGAACAGGCCTGGCCATGCAGG 

CGGCCACAGATGGCAGCCTCTCCTGCCATCCGCTCATTTCCTCCTGACTCATCCTTGTG 

AGCCACTCTCAGCTGGGCACTGCCTGGCCTGC\TGCCAATAGGCTGAGGCCCTGGGA^ 

CACTTCCCAATCTGCGGCACC^TTAAGAAA^TGGGGTTGGTCGGGCGCAGTGGCTCACG 

CCTGTAATGCCAGCACTTTGGGAGGCCGAGATGGGTGGATCACGAGGTCAGGAGATCAA 

GACCATCCTGGCTAACACGGTGAAACCCCGTCTCTACTC\AAA^^ 

GGGTGTGGTGGCGGGCGCCTGTAGTCCCAGCTACTCGGGAAGCTGAGGCAGGAGAATGG 

CGTGAACCTGGGAGGCGGAGCTTGCAGTGAGCTGAGATCGCGCCACTGCACTCCAGCTT 

GGGTGACAGAGCAJVGACTCC\TCTCATAAAAAAAAAAAJ\AAAAGAA 

GGTCTCTAGAATGGCTCCCC\CAATTCTGGCCCCTGCTGCAAACTC\GTGAGGTTCCCA 

GCTACAGAGCAGCCCTCCTGGCGGGTACCTGTCCGGCTCTTCCTTCTCCGTGTGACAAG 

GGAGGTAAGGGAGGTGCAGGAGCAGAGAGTCCACAGCGTTGCTGATGGACGGGGTGGGG 

ATTTTGCGGGAAGGAGTCGTCAGGAGAAGCTGCGGAACTCTTGCATGGGAAGGCTGTGG 

GACTGAGCGGAGAGCAGGAGCTCGCTCGGCA.CATGATCTGTTTCTAGAGTGCAGAGGGG 

CCACCCCACCTGGGGCCATGAGCTGAGGAACAGGTGCCCCGGCTTCCCTGGCTCGCCCA 

CCCCAGCAGCTTCTGCTCTGTGGTCAGGAGGCCTCAGTGGGGAAATGCTCTTGGGGCAG" 

AAACATGTCCTGATGCCTGAGGAAGCAGGTATGGGGCAGAGAGAGTGTCCGGAGCCACG 

CCAACACTCAACAGGACGCACCTTGGGCCCGCACCGTCCTdCTGCCGAGCCCACGTTT 

ACCCTTCTCACTGCTGATTCCAGGATCTCCCCTCTGCAACCCTAGATGCTGGGCAAATC 

CATCACCTCTGAGCCTCAGGTGTCCCACTGTGGGGTGAGACCCTGTGGTCAGACCTGTG 

CGGGGGCGGGCTCCTGCTTACAGCAGTGCTGGGCCCAAC^TGTCCTGCATCCACTCACG 

GTCAGCTGACACCTTTCACTTC\GGTATCAATGTGTAACTCTCAAGACACGGATTTCAC 

AAGTGACACCGTGAACCCCTCCCAGTGAAAGATGTGGAAATCAGCAAACTGCCATTTCC 

CCATCTCATGTCCCTATTTCTGTTGAAAAAAATTAACATTATGTATCTTTCCTAGAA 

TAAGTATTTAGGTTTCAATTCTGTATTAATTGCAGTTATTTAATGTCACTTTTGCCAGA 

TGACATTGCTACTTCTTTGCCCTTTGAAGTGCCCCTCCCCACTCC\GCCGCCCCCAGTG 

GCCAACTAACGCTGTGGGGCCTCTGCAGCTGGGCTGCAGC\CCCTCCCTGCTTCTGCAC 

AGAGCGCCTCTGAGAGAGCGGCCTGCAGCACCCTCCCTGCTTCTGCACAGAGCGCCTCT 

GAGAGAGCGGCCTGCAGC\CCCTCCCTGCTTCTGCAC\GAGCGCCTCTGAGAGAGCGGC 

CTGC^GCACCCTCCCTGCTTCTGCACAGAGCGCCTCTGAGAGAGCGGCCTGCAGCACCC 

TCCCTGCTTCTGCACAGAGCGCCTCTGAGAGAGCGGCCTGCAGCA.CCCTCCCTGCTTCT 

GCACAGAGCGCCTCTGAGAGAGCGGCCTGCAGCACCCTCCCTGCTTCTGCACAGAGCGC 

CTCTGAGAGAGCGGCCTGCAGC\CCCTCCCTGCTTCTGCAC\GAGCGCCTCTGAGAGAG 

CGGCCTGCAGCACCCTCCCTGCTTCTGC^CAGAGTGCCTCTGAGAGAGCGGCCTGCCCT 

GGCCGGCCACCTCCTCCCACTACCTGAACATCAGCTCCAGGGTGCAGGGTCCATGCAGT 

GCCCGCTCTGCCCCCAAAGAGCCCTCTGCACCCCAGTGAGGACTGGCTCTTCGGCTCTT 

GACCTTTGACTCATCAJVCACACATTAACAAAAAJ\AATCCCTGTCTTG 
CTGTCCAATCCGGTAGCCACACACGGCTCTCGGGCACCGCACAGTGGCTAGTCTGAAAC 
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CTGACATGCTCTAAGTGTAAAATAC\TGGTGGATT^TGAAGACTTGAACC\AAAA^^'' , 

GTAAAGTGTCTCATCAATCACGTTCTACTAATCTC^.TGATGAAAGGAC ATC 1 TG T \TGT 

GAGAGAGTTAAACGCGCCGATGAAACCATCCGTGACACGCTGTGAT\GGCGn3GCTGGC 

TGTGCCTTTGCTCCATGTGGAGCTGACGGACCACCTCCTGTGGTC A.CCCTC ACGACCAC 

CGTGACGCTGGACACTGTCC^GGGTGACCTGTCTCC\GGCTGTCAGGGAGGGTCACGAC 

CACTGTGACGCTGGACACTGTTCAGAGCGACCTGTCTCCAGGCCGTCAGGG^GGGTGGG 

GCTTCCCTCAGAGGCCCTGGCTTCTCCCTCTCTCACCCTCAGCCTGTGCCCCCGACACT 

GTCTGGCTCTATGCACTGAGAGGGAGCTGCCTGCTGCCTCCCCGACCCCACCCCCCTGC 

ACTGGGGTCTGAGAGTGCCTTGGTCCTCACTCAGCCTCCCTGGGCTCGAAGCTCTTCCT 

CCCAGCCCCCAGGGGAGGTCCTCCTATGACAGCCGGCACCCCACAATCCATCTCCTCCG 

CCTGTCTCCCCTGAAGAGCCCCCAGCGCCAGGGACACAAATGGCCCCCAGGGTCTGACC 

TTACAGAGCCCAGCCGTTTCCTAGGCAGCCTGCCTCCCCACGCCC\CTCCA?GAGACTC 

ACTCTCCTGCTCAGAAGCCCTTCACTGGCTTCCCGGCTCACCAGAGGTCGTCCAAAGGC 

CCCTCCACAGCCCGCCAGGCCTCCCCCTCACCACCCCTTTCAGGCCGTTCCTGCTC^CG 

AGGCTCTGGCTCTAOVGTCTCCTTTGGGGAGACCTCAGTGCTCCCCCACCGCGGGGCCT 

CTGCATGGGCCAGTCCATCTGCCTGCCCATCCCACTGCTGGCTCCTTCGAGTCC' TV TGGG 

GTCTCGGCTTCCCCAGCTCCATTCAGGCACGGCCCTCCCCAACCAGCCTTTCTCGCCCA 

CAGCCCTGCCTCCCTCCCTGAGCACTGAGTCTCTGGGTCATCCGCCCCTCAGACAGAAG 

CTAGGAGAGGACTCCGACCTCTGCCCTGGGAGGCCCCATGCCGCGCTCCATGCCCGGGG 

CTCACCTCACTCCTATTTGGTAAAACAGCCCTGAGCTGCTGCATGTGGTGATGATTATC 

CGGAAGCGTGAGGCTTTCCAGATGTCTTCTCCGTCTCTGCAATACGGTTTGACGGCGTC 

AATAACTATCTGGGGGCAAGGAGGCGTAGAGTTAGTTTACTCCTCCAATTATTAAGCAG 

AATTCC^AAAGGGAACCTTGCCCTGGTACC^CTTCTTAGTTACGTCTAAGTCATGATTT' 

AAGGTTTTTTTCTCVJUVTATCTCATACTTCCTTATCAATTCAGTCCATGTTTTATCAA 

CTCTAGAGCCACAAAIATAACACATAATCCTGCAGAGGGTTCCCCAAAAC^CTTTGTA 

CATOTTCTATTCAGTTTGTTTGTTTGTTTATTTAGAGAGGGAGTCTTGCTCTTGCTCTG 
TCGCCCAGGCTGGAGTGCAATGGTGCAATCTCGGCTCACTGCAACCTCTGCCTCCCAGG 
TTCAAGTGATTCTCCTGCTTCGGCCTCCGGAGTAGCTGGGATTACAGGCGCATGCCA.ee 
ACGCCC\GCTATTTTCAATGTCTTTTTTAAACGCAATATTTCAGGCTGGGCA.CAGTGGC 
TAAAGCCTGTAATCTC^CACTTTG<3GAGGCAGAGGCGGGTGGATCACTTGAGATCAGG 
AGTTCGTGACCAGCCTGACCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAATT 
AGCTGGATGTGGTGGCACACACCTGTAATCCTAGGTACTTAGGAGGCTGAGGCAGGAGA 
ATTGCTTGAACCCGGGAGGCGGAGGTTGC\GTTAGCCAAGATCA.CACCACTGCACTCCA 

gcctgggcaacaagagcaj^aaactctgtctc\aa^^ 
tcgctctataaccaaaccag<:aacttaaaggaatacttaaaaccaaaccaggccgggca 

CGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCAGATCATGAGG 
TCAGGAGATCGAGACCi.TCCTGGCTAACACGGTGAAACTCCGTCTCTACTAJiJ^AATACA 
AAAAATTAGCCGGGCG^GTGGCGGGTGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGG 
CCAGAGAATGGTGTGAACCTGGGAGGCGGAGCTTGCAGTGAGCCGAGATCATGCCACTG 
CACTTCAGCCTGGGCGACACAGCGAGACTCCGTCTCA^ 

AAAAAAC^CCAAACCAACAAAACAAAACTACTTAAAAAGTAACAAGACGTTTGGGGAT^ 
AAATAAAAGAGGAATGTGATAATTTTA(1\ATGCTAC^ 

ACTC\C\C\GAGTGAGAATGTGATGAAATAGTC\GCAATCA?.GCTCCACCA.TTTAGCGG 
TGAGGGAGGGCCCATGGCAGAGCC^TTTCTCiAACACiGCTTTCTGTGGCGCCTCTGTG 
CCC\CCCTCCTACCCCTGCCCTCCCCTGTCCACTTGGCATTTCCTATGTTCC\GGGAGG 
GGTTCTGGTGCCCCCC\CTGCTTGCACCAr.GGGCTCACCTCCTCGAACCTGC a .GGTGGC 
GTTCACCCGGACCATGGTGGCCGGCTGTAGCACCTTGCTCTCGTGCAGCCGCAGAC\CA 
CGAGGTCAGCAGC\CTGTTGGAGGGCGC\CAGACTAAAATCCGACTGTCCG^^\AGGC\ 
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AAGTGTACCTTCATCA^\GACACACAA.GAGACCAAGGCTCAACATCCACACCGCAAGGT 
GGAAACCAGGAGTCTAACCTCAJ^CACACACATTCGGKjTC 

CTCAGATGCCAGCCCCCAACACACTGAGACCTGGGCCACGACTCTGCCTCATCACACTG 
CTGCCAAAGCATGGGTTAGGTGGGAAGGTTCTCCGCACTCTCTAAGATTCTAAACCACC 
TTTC\GGGCAGAAACAGAGTCAGATTCATCTGCACACATCTGGCGTGTGGTACAATTGT 
GCGCTCAATACCTGTTTGCTAATTTGAATCCTGAATGACGCTAAATGTAGATGGTCTTC 
GGCTTATGATGGTTTGACTTAACGCTTTTTTGACTTTATGATGGTGTGAAGACCATCCC 
CAGTCAGTCTGCTCCTCACCTGACAAGGGATCATGCCTTGATGAACCTGAAACAGGCTT 
CCCAAATCCa^TTACGATGGGTTATTGGGTTGTAACCCCATCAAAAGTTAAGAAGCATC 
TGTAGACAATAGTTTCATAAACTTAACTTCAATTAC^AATGAAGATATCTAAAACTTCT 
GTGGAATGGTGGCTGTAGCTACTATAAGAGTAACTTCTTTTCTGCAGCAAAATGTCATG 
ATTCAGATAAAATAGTTTACCAGGCTAAAAATGAGTTTCCTTCTAAACCTATGAAATAA 
GCTTTTAA^CCCTAC^TATCCACAAAAAC^GACTTAA 

TCTCAAGCTCTTTCCAACTGTCTGATTCTACAATCTAA^CTCACACTTCCTGAAGACAC 
AGACGACACATGAACAGCAGCACAGAGCCACAGCACCCGCGGCGCCGCTGTCCTTACCT 
GTAAAAC^GCCTCTATTATTGTCACTGTCTTTCCAGTACCAGGAGGTCCAAAGAGAATA 
TACGGGAGGGGACGGCAGTCACCACTCAGAATCCTTTTAACTGCTAACTTCTGATTTTC 
ATTTAGCACTGGGTTGAAAJ^CTCCATCTTTC 

CTGTATTCAAAGTGAAGAAAGTTTCAGTTAGATGCAAAC\GCTCTCAACAGCTGTGGAC 

ACGTCCTC^CCTAGGTCAGGCGTCAGC\CGTCTGTGCAAAGGCTAJVATCAGAGTCAlCAG 

ACAGGGAGTAGCTCTTCAGTGTGACCAGGGGTAAGATGCGCACAGGGCATTCCTCTTAA 

TAATCTGTTTGCATTTTTTTTTTCTTTTTTGAGACAGGGTCTCGCTCTGTCACCCAGCT 

TGGAGTAC^GTGGCACAATCTCAGCTCACTGTAACCTCC^TCTCCTGAGCTCAAGCGAT 

CCTCCCACGTCAGCCTCCCGGGTAGCTGGGACCAC^GGCGCCCACCACCACACCTGGCT 

AATTTTTGTATTTTTTTTTTTTTATAGAGACAAGGTTTCGCCATGTTGTCCAGGTTGGT 

CTGGAAGTCCTGAGCTC\TATGATCTGCCC\CCTTAGCTTCCCAAAAGTGTTAGGATGA 

CAGGTGTGAGCCCTGCACTCAACCTGTCTGTGCTTTGTTAAAGAGGGTCAGGAGAACGA 

ATGCAGCTGCTGAGAGGAAAATGACAGGCTGTCAACCGATTCTGCGGCTTTAGAGATCA 

CACTCAAATATTAGAGACTGGATTAAAAAATGTCCACATCTGCAGAGTACCTGGAAAAA 

AAC^CCC^GAATCTAAAAGTCTTCTTAAAGTATCTAAGCTAAAACACCAAGT 

TCAGGGTGGAGACGGCAGCCGCCTGTCACAGGCCACACGCCCCCTACTGTTCTTCCTTC 

TGCATGAGAGCCCACTCCTCAAAGTCCCCGTTTTCCAGGCGACAGTCTCAGAGTTGCTC 

TTCTCCTTGCCCCTCATGAGCACATACCCCAATCGCTCATCTCTGCAGTAAAGGGTGCC 

AGOVCCGGCAGGTCCTTGTCACCAACACGCCTCTCCTCTGTTCCATGCTCAGCTTGGTC 

CGTCATTGTTTTCCTATTCTAGAAATTA 

GTGTTTACACTGGCTTGGTTGAAAGAGCAAACGTAAACATCTAGTGTTCTACTTAGTAT 
ATTTATTTAAC\GCTCTTTGGATGGATCAC\GGTCAGACCCTTTGAAAAATAAA 
AACAAACGATC\GATAAAAAA.CTGCTAAAAAAGTTTG^ 
TTTAGAAATAAACTAAAJITT^ 

TAATATCATCTTCCCTTGGTACATCCTACCTGCTTTCCAGGCTGGACAGTGGGGGAACC 
TGCAGGGGACAGTCAAGGGTACAGCCC^GGGCTCTGGGTCTCACGGGCCTGGAGAGTGT 
GCAAAGTGCCAGTGCTGCCTGTTGACGTCAGGCAAGATCCTGCACCAGGCCGGGCACGG 
GGCTC^CGCTTATCATCCCACC\CTGTGGGAGGCCAAGAC\GGTGGATCACTTGAGCCG 
AGGAGTTTAAGACTAGCCTGGGCAACATGGTGAAACCCTGT^ 

AAAAAAAAAAAAATTAGCCGGGCATGGTGGTGCTCACCTGTGGTCCCAGCTACTTGGCA 

GGCTGAGGTGGGAGGATCACTTGAGCCCAGGAGGCGGGGCTGCAGTGAGTGGAGATTGG 

GCGACCAACTCCAACCTGGGTGACAGAGACCCCCGAAJ^CTCATAAAA^ 
GCACGCAGTGGCTCACGCCTATAATCCCAGCACTTTGGGAGGCTGAGGCGGGCAGATCA 
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CAAGGTCAAGAGATCGAGACCATCCTGGCCAAC\TGGTGAAA.CCCTGTCTTTACTAAAA 
ATAC^AAAATTAGCTGGCCGTGGTGCCACCAGCTACTCGGGAGGCTGAGGC\GGAGAAT 
AGCTTGAACCCGGGAGGTGGAGGCTGCAGTGAGCCAAGATCGTGCC \CTGC A.CTCC <VGC 

CTGGCGACAGAGCGAGACTCTGTCTCAAAACAAAAAAAAAAAAGGATCCT^C *C ^GA? 
TTGGTTTTCTGTGTGTTCCLAATGTAAGTAGTATTTGTCCTGAACC^ GTGGG? T^rrr aa 
TTTTTTTC ATTATGATCTGTAATTCTTTGTTAAATAACTTC AT^AT^TTC \T AGG£ T^G 
ATTCTGGAATCTATAAAATCAAAAGTTCTGGGGCCAGGGGTAGTGGTOCACACCTA*' r AA 
TCTGAGCACTTCAGGAGGCTGAGGTGGAAGGACTGCTAGAGTCCAGGAGGTCA^GGTTG 

CAGTGAGCCATGATTGCGACACTACACTGCAGTTAGGAGGACAGAGGAAGACTCTGTCT 
CAAAAAAAAAAAGTTTCACGTTAAAAAATTTAG^^^ 

CAGGCTTGTCCAACC\GCGGCCCATGGACTATATAC\GCCTAGGATGGC^CTGAA'PGCA 
ACCCAAC^CAAACTCGTAAACTTTTTAAAACACTATAAGATTTTTTTGTGATaC^^TT 
TTTTTC^GCTCATCAGCTATCATTAGTGTTAGTGAATTTTATGTGTGGCCCVGACAAT 

tcttcttgcaacGtggcctagggaagccaaa\gattggacaccca^ggtc t Waaggtt 

ATGCCTATAACCTCCTCCCACAACCATTGTGTTTTGCAGAGTGTGACTGA.C^A.CAA'^A 
AGGTGCACATATTTAAAGTGTGTGACTTGAC\AGTCTGACGTGCACATACCCATAAAAC 
CATCAGC^CAATCAAGATGACAAATAGACCTGTCAGTCCCCAGAGCCGCCTTGTGCCGC 
AGCCCCTCCAGTGGCACTGGCTGGCTCACATGCTCTCGAACTTCATACTTC^iVTAAAT 
GCAC^CAGTGTGATTCATTGGAGAGCCACCTGTGTTGTTGCATGTC^CAGT^TGC'TCC 
TTGTATTGCTGAGTC\TGTTCCATTGTACGAACAC\ATGC AAC rT 'TGCT rT, a TCC A.TTCAC ' 

CTGCTCACGGACACTGGGTTGTTTCGTTTTTGGTGATTAAAAACCCAGCTGCTGTGAAC 
TTTGTGTATGGGTCCTCACGTGGCCTTATGTCTTCATCTCTCTAGAGAGAA^TGGCTGG 
GTTGTATGGCAGGTGCGCGTCC\GCTTCTTAAAAAC\CCTTTTGCAC\GTGGTGTGCCA 
CTCCCCATTCCCACAGCAGGGTACGTGCCAGCGGCCCCACGGGCTCACCAACGCGCA.GA 



PA.CCATC 

TGTATGTATTCTCTGGTAAAGTGTTCAAGTGTTTGATCCATTATAAAAAA^GAGTTCT 
TAGGCAGTTTGTAGAGTACTTTCATATGTTTTCCGAATACAAGTTCTTGATC^GATGTG 

TTCTTTGTTTTTTTTTGAGACAGAGTTTTTCTATCT 

TGGGCTCAAGAGATCCTCCAGCCTCAGTCTCCTGGGTC^CTGGGACTAAAAGTGTATGC 

CATTGTACCTTACTGTCTAAGAATGTCTTTTCAGCAGCAAAATATTTAATT^CACTGAA 

GTCCAATTTATCC\TTTCTCTGTAAGTC\TGCTCTTAGTGTCACATAAAAAAAAATCCT 

TGCCCAAAC^GAGGACGTGC^GATGTTCTTCTATGCTGTTT'CCT^GAT'GCaTaGTTTTA 

GGCTTGACATTGCGGCTATAAACCGC^GAGTAAGTTAGTTTT^GTCTGCGCTG' T, G a GCT 

ATGGACCCAGGTCCATGTTTTGCGCGTGCCCTGCATTTACAGACCACGTGAGG^AAAAC 

AGGTAACTTGATAATATGGAATCTTCCGGCCCAAGAGATACTAA^ATCTCCCTCCATTG 

ATTTACGTCTTTTGTTGCCCATCTTTTGTCAGATGTATCCCTAATGTTTAA ? ^C a TTT& 

GGTGCTACTTAAAA\AAAAAAAAAATTCAGGCCTGGCATGGTGGCTCACGCCTGT a ATC 

CCAGCACTATGGGAGGCTGAGGTGGGCGGATCATGAGGTCAGGAGTTTGAGiCCiGCC^ 

GATCAACATGGTGAAACCTTGTCTCTACTAAA\ATACAA^^.T^^GCAGGGTG T GG T GG 

CACGCACCTATAATCCCAGCTACTCAGGAGGCCGAGGC^GGAGAACTGCT'T'GA.iTCTGG 

GAGGCGGAGGTTGCAGTGAGCCGAGATCCGCCACTGCACCTCGGCCTGGGCAACAGAGC 

GAGACTGTCTCAAAACAAACAAACAAAACACCAAAA^AC^A^^CTGAG^G^^AC^GG 

^TA^TATACAACTTATTTTTCTAC^CTGATCTTGTATCT^ 

TGAGACAGGGTTTCACTTCi.TCACTC\GGCTGGAGTGTAGTGGTA^ 
TGC^GCCTCAACCTCCCAGGCTCAAGGGATCCTCTTGCCTC a .GCCTCCTGAGTAGCTGG 



WO 01/66752 



57/108 

Figure 61] 



PCT/US01/07371 



GACTAC^GGTGTGTGCCACCATGCCCA^ 

GGGTCTTGCA.TGTTGCCCAGGCTGTTCTCAGTGATTTTCCCATCTC^GCCTCCCAAJVGT 
GCTGGAATTACCGGCACGAGTCACCACACCCGGCCTCC^TCAGACTTTCTACAGGATGC 
GGATGCCTGTTCTTTCTGCTTCATTGCCTGTCTACAATGCACAATGCTGGGCAGA.AATG 
CTGAGGAAACATGC C CTGATC CTGATCTTAC AAGCAAA.GAJ\C^TTC\GTT ATCACTAAG 
TAGGTTTTTGTAGGTAAAAGAAATTTCCTTTTATTTCTA 

GAATGGATGCTGGATTTTGTCAGATGCTTTTTCCAAGTCTACTGCCATGATTATATGGT 
TTTTCCTTTTCAGTTTGTTAATATG^^^ 

ATTCCTGGGATGAACCCTCTTGGTCTCATATACATGTGTCATATACATAGATACATATA 

TGTATACATATAAAAATATGTGTATATAAAATATGTGTGTATATATAAAAATATGTATA 

TTTATATATGTATATATAJIAATATGTATATTTATATATGTATATATATAAAATATG 

ATTTATATATGTATATATATAAJVATATGTATATTTATATATGTATATATATAAAATATA 

TATATGTGTATATACATATGTATTTTTTGAGAC^GAGTCTTGCTCTGTCGCCCAC^C'PG 

GAGCGCAGTGGCATGATCTTGGCTCACTGCAATCTCCACCTCCTGGTTTCAAGCGATTC 

TCCTGCCTCAGCCTCCTGAGTAGCTAGGTCTACAGGCACATGCCACCACACCCAGCTAA 

CGATCTCGGCTCA.CTGCAGCCTCCGCCTCTGTTCCATGTTAGATAGTTTCTATTGCTCC 
GTCTTCAAGTTCACTAATTTTTCCTTCTGOVTTGTCTAGTCTGCTGATAATTCTGTCCA 
ATATATTTTTCA.TTCTAGGCATTGAATTTTTCATTCCTAGAAGATAAAATTGTCTTTTA 
TATATCTTTCGTGTCTCCACTTAACCTGCTCATGTCTTCTACTTTCTTGAACACATGGA 
ACATATTTATAACTGTTTTAATGTCCTTGTCTGCTAATTCCACCATGTTATTTCGGGGG 
GGTTGGTTCCAATTGACTAAATTTTCTC^ 

TGTAACCAGATGTCAGTGTCCATTTTATACCTTGCTGGGTGCTGGATAATTTTATATTC 
CTATAAATATTCTTAAACTTTGTTCTGGGATGTAATCCAATTATTTGGGAACAGCTTGC 
TTCTTTTGAGGCTATCATACTTTGTCAGTCTGGGCCGGAGCAGCCTTCAGTCAAAGGCT 
- GGACTTTCTACGCTCCTGAGACAACACCCTCCCATGGACTCTA 
TTTTTCTTCTCTGGCTGTGGGCACAGCTATTTCTGGCACTGTGTC^CTTTTAAGGATT 
GTTCTTGCTCCTCCTTCCAGCTGGTTCTTTTCCAGGCCTGGGTATTTTACTCACATCAT 
GCACTGATCAATGCTCAGCTGAAAACTCA^^ 

CTCTGTCCCCTGCCCTCTCCCCAGOVCTCTACTCTGTCAGCTTTATCCCCCTTGTCCTC 

CCCACCCTTCCAAATGCTGTCTCTGTACTCAGGTCACTGCACTCCGCCTGGGCTCCCTC 

TTGGTGCTCTCTGGCCTAGACACTCTCGGAAGGCTGTGAGTTGGGAAAACCATGAGGTC 

ACCTCATTTGTTTCCTCTCTCAGGAATCCCTGTCCTATGATAGCTGATGTTAATGTCTG 

AAATTGTTGTTCCATATAGTTTGTCTGATTTTTTATTGTTTCAGGAAGGAAGATAAATC 

TGGTCCCTGAJ^TTCTATCTTGGTTGACATATGAAATATTCTTG 

TAGTTAATCCATAAGAAGATAAAGTTTC^GGTTTTAAAACCTACAGTCAATTA.GGA^ 

TGCCCACCTAAATAGTAACACCTCCCTGATCTC^CATGCCATCTGC^CCACAGACTCTC 

CCCGAGAATCAGGCAGTA.GAATGAACCAGTAATGAGGTGATGACGGAGAJ^GAAAAGCAC 

TGTCCAA.GGAJ^GACTCTTTTCTATTTGCAGAAC 

CC^CAGATGACTTTCAAGGATAACCTCTTGCTGCTTCATTTGTCGTATTTTCCATGA.TT 
GTATAGGCAJI^CTAAACCTAJTCATTTTTGCATTTTC 

TAAAACCAAATTTGAATAGCACAAAATAAGTTGTAAGCTCCACCATGACACAGATGATT 

TTCTTGTTACTAGAAACATGAAGATAAATAAAACCACTTCTATTGTATTCACAAAJ^ 

TACTGAGAGCCCAGGACAATACCTGGC\TGTAATAGGCACTATATTACTA.TTTATGGAG 

TGACTGGATTATAGAAGCTTTTTTTTTTTTTTTTTGAGATGGAGTCTCGCTCTATTGCC 

CAGGCTGGAGTGC^GTGGCGTGATCTCGGCTCACTGCAACCTCTGCCTCCTGGGTTCAA 

GTGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTAC\GGTGTACACCATCATGCC 



WO 01/66752 



58/108 



PCT/US01/07371 



Figure 61k 

CGGCTAATTTTTGGATTTTTAGTAGAGATGGGGTTTCACCATCTTGGCCAGGCTGGTCT 
TGAACTCCTGATCTCATGATCTGCCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGC 
GTGAGCCACTGCGACCGGCCTAGAAGTTTTTATAAJ\ATATCTTTGCAAC\TTCACAAAG 
TTTGCAAATGTTAAC AATTTGC GGGGTG TGTATC CTTC C AGAC ATTTTCC ATGC AT AC A 
CAATCACATAAATGTACCTAAGTGTACACATAAACCATATAATTTAAAAATAAAACTCT 
AAGGCCGGGTGTGGTGGCTCACGCCTGTAATTCCAGCACTTTGGGAGGCCGAGGAGGGT 
GGATC^CAAGA.TCAGGAGATCGAGACCAGCCTGACCAATATGGTGAAACCCTGTTTCTA 
CTAAAAATACAAAAATTAGCTGGGTGTGATGGCACATGCCTGTAGTCCTAGCTACTCTG 
GAGACTGAGACAGGAGAATCACTTGAATCTGGGAGGCAGAGGTTGCAGTGAGCCAAGAT 
TGC^CCACTGCACTCCAGCCTGGGTGACAGAGCGAGACTCTGTCTCAAAAAAAAAAGAA 
AGAAAAGAAAAAAGAAAAAGAAAAAACCCAACTCTAAGGTCATATAAACATCTGTACTT 
TGCTTCC CCTATTTAATATATTGTAGC CATCTTAGCCCTGCATCTTAGCC CTTGCATAT 
AAACTTTACACTAAATCTATTTTTGAATATTT^^ 

TGAGTTATTTGTGAACCA.CACTGCTCTAATACTCCAGTC AACTAAGCGCTAATA.C TC CG 

AGAAGGAAACGCCCACCCCCTGTGTAGACAGTTACATCGCTGAGCCATCCCCTGTGTAG 

AGACAGTCACGCCGCTGAGCCACCCCCTATGTAAAGACAGCCACGTCGCTGAGCCATCC 

CCTGTGTAGAGAAAGCCAC^TTGCTGAGCACTACCTTTTTGCTGGTGGACTGTCCACTG 

CTTTTGGTGTCTTGTGCATGGTTCCAATTTCCCGTCACTTGTGGA.GACTGTAAJ^.TAAT 

TTCTTCTGGAAACAAC^CTTTTAAAACAAAAAA^ 

AGCTAAAAAAATATTAATAGAGGTTGTTTATGGCTC 

TTCCCTTCCTTTCTTAACCTGCCTGTTATACTGTTTCTAAATTATAGCATTATTA.GTAC 
TGCATATTCTTCACAACAAAGCCCTTTATTTGCCCATC^ 

TTTATGAAGGGAAAAGCGAGCAAGATAAATTTATTTTCGGTTATTTAAGCATGCTATTA 
TTAGTTATTCAC TAAGTTGAAATAGTAAGAAAAC AC TAAAAC TTGATAGAAAATTAAAA 
ACATGAATATAAC AAAAG TTTTAC TTC TAATATT ATG ACTGAAG AG AC TATTTTACACA 
AAATTTTACATGAAGCCTATAAAAATACTCTGTGAAGATGTAACACTATCATTCTATAA 

AAGTAATACCTTTTACACCTAAGTGGATGAGGTGTTCAAGTGCAAAGT^ 

CTTGTGGTC CTGC^GGAAAGAAAACC AC\TAAAATAACTAAC CTTACVTTCAGAAATAT 
TAAGGCTGGCATAAGGTAATGAAA^TTTTATGATACAJVT^ 

ATCACAGCAACAATCCAACAATCAATGAJ^TAGAGAACAGAAAA-ACACTAGAGAGGACG 
GGCGCGGTGGCTCACGTGTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGGTGGATCAC 
CTGAGGTCAGGAGTTCGAGACCAGC CTGGCCAAC ATGATGAAAC CCTGCC TCTACTAAA 
AATACAAAAATTAGCCAGGC ATAGTGGTGGTGC AC GC C TGTAATCC C AGC T AC TC AGGA 
GGCTGAGGCATGAGAATCACTTGAACCTGGGAGGCAGAGGTTGCAGTAAGCTGAGATGG 
CACCACTGCACTCCAGCCTGGGTGACAGAGTGAGA^CTCTGTCTAAAAAAATTTTTGTAA 
AAAOCACTAGAGAATACCAAAGAAGCCAAAAAAAAAAG^^ 

CATTCTTTCTTTCTTTTTTTTTTTTTTTTTAGAGACTCTGCTCTGTTGCCTAGGCTGGA. 
GTGCAGTGGCACAGTCAGAGCTCACTGAAGCCTCAAATTCCTGGGCCCATGTGATCCTC 
CC\CCTCAGCCTCTCAAAGTGCTGGGATTACAGGTGTAAGCCACCACACTTGGCCCCAT 
ATC CATTCTTGATT AAAAAC TCAC TAAAATAAGAATC TATC TC C TC AACC TG AT AGAGG 
GCCACTACGGAACCTACCACCCACGCTGTACTAACTCGCTCAGGACTGACAGCCCTGCC 
C CT AAGAC CAGGGACAA^GCC AGTGC AGC AC TTGCCACC AC T T CTATTCC AC ACTGGCA 
C AC AAGTTC C AAC CAGCGC AGGAAGGCAAGGAAAACAArVT AC ACAGC ATC C AAATAAGA 
GAAGAAGAAGTAAAACTGTCTTTGGTTTTTTTTTTTTOVGACGGAGGCTTGCTCTGTCG 

CCCAGGCTGGAATGAGTAGCGTGATCTCGGCTCACTGGAAC C TC CGCCTC CTGGGTTCA 
AGCAATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGTGACCACTACCACGC 
CCGGCTGATTATTTGTATTTTTAGTAGAGATGGGGTTTCCCCATGTGGGCTAGACTGGT 
CTC^ACTCCTGACCTCAGGTGATCCGCCTGCCTTGGCCTCCCACAGTGCTGGGATTAC 
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AGGTGTGAACCACCACACCCAACCTAAAACTGTCATTCTTTGAATCAATGATTATATAT 
GTAGAAAJ^TTCTACAGACTCTAGGAGAAGCTACTGGAGCTATTCAGTTTAGGAGACTTC 
CAGGATACAAGGTAAACATACAAJVAATCAATTGCTTTTTATAAATACT^ 
ATTAGAAATTTAAGTTAJLATTTACTGTTTATAACAGCCATOT 

ATAACTGACAAAAGATCTATAAGACCTATAC\CTGAAAAGTATAAAATATTGCTAAAAG 

AAATGAAAACAGATGCAAATAAATGGAGAGACATACCCTTTTCAAG^ 

ATTATTATTAAGATTTCAATTCTCCAATTCTCCCTTAAACCGACCCATAGGTTCAATGT 

AATCCCAATCAAAATCCCAACAGGCTTTCTGTAGAGACTGACATGACCTAAAATCCATG 

TAAGAAGAAAGACCTGGAACAGCCAGACAACCTTGAAAAGGAACAAAGTTGGAGGCTTG 

ATACTACCTGATTTTGAGACTTATCATAAAGGCACATATCATACACTAGGTTTGTGTTC 

CCCCCAGCTTCGTATGGTGAAACTTCATCCCCAATGTGATGTTACTGGGTATGAGGAGT 

TGAGGCCTTTGGCAAGTGATTAGGTCATGAGGGTGGGGCCTCATGAATGGAATTAGTGC 

CCTTACACAAGGGAGCACAGAGAGCTCTCCCATCCCTTCTGCTGCGTGAGGACACAGGG 

AAAAGACAGCCGCCTACGAACCAGGAAGCAGTCTCACCAGGACACTGAACCTGCTGGTG 

CCTTGACCTTGGGCTTCCTGACCTCCAGAGCATGAGAAATACATTTCTGTTGTTTATAA 

GCCACCCCATCTATGGTGCTATGCTAATGCAGCCTGAGTGTACTAAGACAGACAATAAG 

ACCCTGTAATGTTGGTGTCAAGACAGACAAGTGTACAATGGAACACAACAGAGAGTCTT 

CAGAAACAGAJTCAACAAJVTATTTGGT 

TACAATAGTAJITTCATTGGA.GAAAGGGCAGTTTTTTCATCAAATAATGCTTGGACAACT 
GGATGTTCATTCCACCCTCCAATTTCC^TTCA 

ATGAGATAGATCATAATCCTAAATGTACAGGATAAAGCTGCCAAGTTTCTAAACAGGAT 

AAAATCGTTGAGATCCAJ^.TTAGTTACAGATTTCTGAGATGCATCATTAAAAGATCTGT 

GAAAGAAAACATTCATAJ\ACTGGACTTCACCAJLAATTAATAATTTCTGCATTTC^ 

GCACTATAAAAACAATGAAAATACAAGCTGTAGTCTGAGGAAATATTTGCACAGACACG 

TATCTGATAAAGCACTGTACTGAAACTATA-CAAACATCTCTGCAAACTCAATCATGAGA 

AAJU^TTTTOT 

CAAATGGCAAAAAAGCACATAAAAAGATGTTCAG^^ 

GTAATCCCAGCACTTTGGGAGGCC^AGGTGGGCAGATCACAAGGTCAGGAGATTGAGAC 
CATTCTGGCTAAJZAJIAGTGAAACCCCGTCTCACTAAJVi^ 

AATTAGCCGGGCGTGGTGGTGGGCACCTGTAGTCCCAGCTACTCGGGAGACTGAGGCAG 

GAGAATGGTGTGAACCTGGAAGGCAGAGCTTGCAGTGAGCCGAGATCGTGCCACTGCAC 

TCCAGCCTGGGTAACAGAACGAGACTCCGTCTC^AAAAAAGAAAAAAGAA^ 

CAGCATCATTAGTCATTAGAJVAAATACAAAjTTAAATCCACAACGAGATATCCCTTCACA 

CTAC^GAATCAGCTCTGTTCCAGAGTCAAAGCAGGTCAAGGAGTCCAGGCCTGGACGT 

CGGCTGCAGTCTCTCTAGCTGCCATGATGGAGGCTGTGTGCCTTCTACCAGCGCTGCTG 

CTCCCTGTGGTCCCAGCCAACCCTCACACGTCATCATCCAACACAGCTGCCCCTTCCTT 

CCa^OTCCTGGGTCTTCTCATGGCACTCACATGCTGGGGTCTCTCATCTTGC\CAGCAA 

ACAAAAJIGTACCCCCCTCCTGGTCCTCCTGTATCTCTTTGGTCACCAATTTCTCTCGCT 

CTCTCTCCATCCTGCCCTCCCTGCTTTC^TCGGCAAATGCCTTAAAAAGGAJ^TCAACGA 

CATTTCTCCATTCCCCCGTAGTCCATTCATTCATTCTCAATACATTCCAATCTGGCTGT 

AGCCCAAATACTGCCGATTTCTCAGACAGAC^^ 

TGATAAJVTTTGACTATATTAA^ 

AAAATTTATTTATTTATTTATTTTTTGAGACAGAGTCTCGCTCTGTCTCAGGCTGGAGT 
GCAGTAGTATGATCTCLAGCTCACCGCAA^ 

GCCTCLAGCCTCCCAAGTAGCTGGGACTGCAGGCATGTGCCATCATGCCCA.CTAAGTTTT 
GTATTT^AGTA.GAGA.CAGTGTTTCCCCATGTTGGCCAGGCTGGTCTCGAACTCCTGAX 
CTCAAGAGATCCACCTGCCTTGGCCTCCCAAAGTGTTAGGGATTAC\TCACACCCAGCC 
TCAAAAGATACTTTTTAJUAAGCTAJ^ 
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CTAAGAATAGTTTATCATAAGGA^ 

AiUlTTACTCAATAAAA^^ 

ATTTTACGGCCCACGAA.CAAGAAGATTCTCAGC^ 

ATCAAGACCATCATGAGATGCATTACACACCTACCAGACTGGCACCAATTAGGAAGTCG 
AACAJIAATCAGATTACAGAAAGGGC^CCGATCAG^ 

AAAGTCACTGATGCAAC CACTTGTAAAGCTGGAAAAACCC CAAACCCAGCAATCCC ATC 
CCTACATACACAGCAGCTCTAGGAACAGCAGCA^ 

ATGTCCATCAACAG AAAAAC AGC GGGTATTCACAC AACCGC AAAC CAATGAAC AGC AGT 
TAC^CAACACGGAGACATCTC^GGAACACAATGTTTAGTGAAAATCTCGACTCCTAGG 
. AACC^AATACAGCAA^GTTACTCCTTTTCTAAA^ 

CATTTTTTGTGCATGACTAGTTAATGGTTTTGTGGAGGAAGAAAAAGCCAGCTGGGTGC 
CAGGTCACAGCACC CCTCATC CGACAAGAAAACAAATAAAAGTAAGGAGCTGGTAGCTG 
ACGGTGACAGTGAGGCATGAACCAAAGATCra^ 

ACATGCVAATGTATGCACCTTTATATCTCAAAAAATAGTTGATAATAAAGC TAGAA ACA 
CGGGGTATGCAAAAATGCTGTAGGAGCCCACTTCCATCCACAAGGCCTATACACGTCAA 
CCACAGGCATGGTGGGGGCTCCTACGTCACTACTTAGATGTCAACIAGACATACTC^G^ 
GAACACTGTCCAATTAAAGATAAAGAGAC^TTAACTGGGCATGGTGCCTCATACCTA 
TACTC CCAAAACTTTGGGAGGCTGACGC AGGAGGATCGCTTGAGCC TAGGAGTTTATGA. 
CCAGCCTGGGACCCCATCTCTATTAAAAAA 

CGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCAAGCGTATCACAAGGTCAGGAGTTC 
AAGACCAGACTGGCCAATATGGTGAAJ^CACTGCCTCTACTAAAAATACAAAAATTAGCC 
AGGCGTGGTGGCAGGCACCTGTAGTCCCAGCTACTTGGGAGGCCGAGGCAGGAGAATCG 
CTTGAACCTGGGAGGCGGAGGTTGCAGTGAGCCGAGATTGCGCCACTGAAATCCAGCCT 
GGGCAACAGAATGAGACCCAGTCTCAJ^AAAAAAAAAAAAAAAAAAAAGACATT 
AAGACTCLACTAAAAGC ATT AC CTATTAT ATGTAAATTCC AO-TC CATAGGTT C A AAGTT 
ATAGGCTTGTTCAAA.TTCTGGATTAATTTTAAGAGTTACATCTTCTTCATGAATCTAJ^ 
A^GGAACAAACAATA^ 

CATCATCAAAAATTCATTCTATAGCTCTGAGCTAJ^TTTGATATCT 

GAAAAGGATTAAATCATTACGCCATTCGATTATACAJVACAAGAGGAAATGGATCACTAC 

ACACTCATACTTACTGAACTTCTATTCACTTC 

ATAGAGAATGTGTGGGAGGTC\CTGACTCCATAGGACACLAAACAAAAC^ 

TAGCCAAGTGTTACCTACTGTAAGGAAGCCTCGATAAGAAGGTCTGCTGGGTGCGGTGG 

CTCACACCTGTAATCCCAGCACTTTGGGAGGCCAAGGTGGGCGGATCACCTTGAGGTCA 

GGAGTTTG AGACC A.GC C TGGCCAAC GGGGC GAAACCATCTCC ACTAAAAAT AGAAAAAG 

GA.CCCAGGCGTGGTGCTGGGCACCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAG 

AATCTCTTGAACCTGGGAAGCGGAGGTTGCAGTGAGCCAAGATCGTACCACTGTACTCC 

AGGCTGGGTGACAGAGTGAGAATCCLATCTCAAAAAAAAAAACAA^ 

CAAGACTTATCCTTTTAGTAGAAJ^ 

TTC TGCA-TATTC TTTC CTAGACAGTATGGGTAGAAAAATAAG AAGG AAAGCCC TGATCT 
GAAAAGGAGCTCAAGCTC TTCTCCAGGTGACTOVGGC AAC AC C CTGGTAGGGTG AGTTT 
CCTGAGGGTCAGGGCTGACACCAGCAGAGAGTCGGGAAGCTACGCCCTGGGATGGCTTC 
CAGCTC C GGGA.GGAAGGGAAGCAACAGTGGTACAGGGC CATGGGATTGTTTC CC CTGAG 
GATGAGGAAGA.CLAGACCAJ\GTCCTGACTATACACAGGAGACATGCGTTAGTCTACTTGA 
GAACAGGTGCAGGTGGCATCCAAGAAAGCCTGAVAGA^ 

AAAAGGAT ACGG AAGGTCATTTCAGATGCC AGGGTTTGATC AATGGCATGGAGGC CC GG 
C CATATGGCTCA.GGAACCAGGGGTGGAGGCAGGCAGGTGC CTTCCTCCTGAGATCTCTG 
GCTCTGCTTCTTTGGAGGTTTTGGTATGCAAAGCTTCCTTTCTTGGCACAGCTGGTCTA 
GGAGGCTAAGATGGTCATTCGGCCATCTTGGGCTCCTCATGTCAACTTGGTCAACAGCA 
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AAGGTGGGTAATGGAGAGCTGGCGAGGTCATGTCACTCGTGTCTC\C\TCTCAGTCCAC 

AGGCGATGGCCTAATTC^TTACAAAATCTCTGAAATCCTGCCAAGATGTTGTCTTCTGG 

GAAATATCCCACTCTGTGAAATCAGCCACVGCATGGACCCTGC^TGA^GGTGTAGCGCA 

GAGTGACTGTGGCATCAAAGGTCGGPAGCTGCCTCTTCrCCAGATCCTTATC^GGCTCT 

TCTAGTTCTCTCAGCCCTTCTTCCTTCTGACAGTAAAGGCTCTGGCATTCTCCCTTCA^ 

GTTTTCTACATTAAGTAATGTGTGGCTTCA'TCTTCTCTGATGCACAGATAGAGCACCTC 

CCTTCTCATGCAGATCCC^GTATCACTCCCAGTGCTATGACCL\GTACATGACCAGTAAA 

GTCCAGCCCAAGAGTCACAGACCACCCCATGCAGAACCATCTGGAGCACCAGGATCTAC 

TCAAAGCCAAAAAAACCACCTTCACGCAAACATCAGAGCTGCAi^ 

TTTC CTTTCAAGGC C TCCAC TCTGC GAGCTCTTC TGGAATAC ATTCAGAATGCC C AAAT 

ACAACTCA^CGCAAACCTCATGACTTACATATAACCTAAAA.^ 

GGTTGAAAATTATAGTATTAAGTTTCCTTTA^ 

GTATCTTGACATTTTCCAAAATAGCCAATGCAT^^ 

GGGAGAACGTTAC CTTTC C AACCGGTCAGAAGGGTGTGT CTAACAC AAAC TC ATT AAG A ' 
GAGAGTGCTCTCACCTCAGTCACGTAGCTGATGTATTCGATGGCATGTCCATTGTACTC • 
TOGAGTTTTTAAAATCAGTTTATCACCTAGOTAAA 

AACACAAAATCACAAATTTCTGTGCTGTC CTATAAAATC CAACACTGAGGAGGTTCAAT 
AAAACGTTACTGGGGTCAGGCTGGGTGCAGTGGCTCATGCCTGTAATCCCAGCACTTTG 
AGAGGCCGAGGCGGGCGGATCACAAGGTCAGGGGATCGAGACCATCCTGGCTAACACGG 
CGAAACCTCATCTCTACTAAAAATACAAAAAAAAAA^^^ 

GCCTGTAGTTCCAGCTACTCGGGAGGCTAAGGCAGGAGAACGGCGTGAACCCGGGAGGC 
AGAGGTTGCAATGAGCTGAGATTGCGCCACTGCACTCCAGCCTGGGCGACACAGCGAGA 
CTCCACCTCAAAAAACAAACAAACAAACAAACAAAAACATTACTGGGGTCA^ 
CAACTAATGAAATACAACAAAGAAACATGCTTGATGTACATCGGTTGCTGATTTTTAGT 
GAAATTCTAAC C CLAAAAGAAAGAAAATGC GATAATGTG ACTGTCATATTGC CAGTT AAA 
GAGTATCTTC TC TC AAGTATGGTTTCTC TGC TAGTCC AC C CAGGGAAGCATATGAGTAA 

... CAA\CACACCTGCGTAGAGAGAAGGCCTCCCCT 

AGCAGATCCCCATTCCTTCTTAAGATGATCCCGCTCATGTTATACTCTTTC^GTTCCAT 

TTCTGCATAAATCTCCTCAAGCCACAGCAAAGTCGAAAACTTCTCCTTGTAATTTG^ 

TGTTCAGAAGCTGGAAAAC^CCCAAA.TGTTATAJU^GTC 

TGGAAAGTAGCAGCTCCGGAGCAGGAJ\AGTTCCAACA.GCTCCAGTTCAGCAGATGGCCC 

CCCAGGTGGGGC GATCCTCGCCGGAGTCTGCAGGTGCCTCAGTAAGTTC CCCGGCCTTC 

AGC^CGAGGGCCATGGGGCTCC^GCACACTCCCTGGCCTTTGGCATGAGGCAGTGGGGC 

TCCAGCTGGCGGTGTCCCTCCCTGCACTGCCC^CTCCTGTCCACCCTCCCTGCTCTTTT 

TCAGGTTGTC CACGTTAATTCTGGGAAGGCVGAATTTGGCTTCCACAGAC ATC TGTGAA 

AAAAGGACAAAC T TC C AGGTCTGGCC AAGGTGGAC T AACAAGG AC CAG ATTT AC C CTTC 

TACCTCAAAAAAACTTAAAGAGAC^GAGAAATAJVGAAACGATGGTGTTTAAACAATGA^ 

CATCACGTAGCAAAGGATTGTGATCCCAAGACTGGGGAAACAATCC^ 

GACTGATCCAC C TG TGC CTGAACAGAGC TC CTAGGC C C CAGC ACAGAGG AC C C AGGGAT 

GCTGCATGGTTTCTAAGGAGAGGAGCTCTGATCTGTGGGGCTGAGCGCTGGGAAGAGCA 

AGC AGCACAG A^TGGTCTGGAGA.GGGTC A.GGGGAC C CACTCAGG ACCATCAGC TC AGC A 

CTAAGGAATGCATGAGAC\CC\TCCTGCAAAACCCATTGA.GGGGCCAGACCAGGGCCTC 

AGCATGAGGTGGGGGAAAACTCACAGCAGCCTGCCCGGAATTGGAAAGTGGGA.GGGCAG 

AAAGTGCGTC TGAAC A.CATGGTGGCTGAAAATTTCC AAAT CTGAAGAAAACTC AA-GGC C 

GCATATTCAAGAATCCCAATGAACCCCAAGCACAAG^ 

CACCACACATCGTAAACCAAATCTATTAATATC^GTGA^ 

CAGCTAAGAAAAAACAACGTTATGTGC*AJ^J^ 

ACCCCATAAGAAAAA-CTAAAAAATACTTAGGAGATGAJVTAAAAACAA^ 
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GCAAGACTTATGGGAGC\GCACAAGCAGTGCTGAGGGAATTTGTGACTATGAGCACTGA 
TTCCCATGAGCACTGCAAGGAATTAGACAJA^ 

GAAGGAAGGAJ^TAJVAGATTAGAGKjAGA-GATGAJVCAAAAGAGAGAACTC 

CAGAJ^CCAGGAAJLAC^J^ 

TTTAGTAGATTAACTAGAJ^J^^ 

GG AACATTACTACAGAATCTAJIGGAAJ\TAA^ 

TACCAATACCCAGA.TGAJ^.TGGACAAATTCCTAGAAACACAJL^ 

CCATGAAGAAAGAJ^JTCTGAJITAGACC^^^ 

AATATTCTGAAAAGGAAAACCCCAGGACCTGATGGCTCCATAGCTAAATTCTACCAAAC 
ATTTGAAGACTAACTAATACCAATCTTTCTCAAACTTTTCCAAAAAATTCAAGAGGAGG 
GAATACTCTCTGACTTATTCTTATGAGGCCAGCATTACCCTGATCCCAAAGCCAGACAG 
AGACACTTCAGGAAAAGAAAACTACAGACCAATATGAACACTGATGAAAAAATCCTCAA 
CACGATACTAGCACACAAAATTCAGCAGCATATGAAAAGGATTAGCTGGGTGTGGTGGC 
TCATTCCTGTAGTCCCAGTTACTTGGGAGACTGAGGTGGGAGGATTGCTTGAGCCCAGG 
AGCTCGA.GGCTGCA.GTGAGCAGTGACTGAGCCATTGCACTCCAGCCTGGGCAACACAGT 
GAGACCCTGTCTCAAAAAACATATATAAAATAJ^ 

AATTTATTCCTGGAATGCAAGGATAGTTCAACATTTGAAAATCAA.TTAC 
CACATTAACAGAAGGAJ^AAAAAATCATATGATCATCTCAATACAGAAAAAGCAATTGAA 

CAATGGTGCGATCTCAGCTCACCGCAACCTCTGCCTCTTGGGCTCAAGCGCCTCCCAGG 
TAGCTGGGATTACAGGTGCATGCCATCACACCCAGCTAGCATCTGACAAAATTTAACAC 
CCTTTCATGATAACGTTTAACAAACTAGGAAGAGAAGAAAATCAGGCTGGGTGTGGTGG 
CTC^CACCCATACTTGCCTGTAATTCCAACACTTTGGGAGGCCAAGGCGGGAGGATTGT 
AAGACCCTGTCTCTATGATATACAAAAGTTAGCCGGGCATGGTGGTGTGCCGCTGCAGT 
CACAGCTGTGAGCTGTGATTGCGCCACTGCACTCCAGCCTGGGCGACAGGAGAGCCTGT 
CTCAAAAATAAATAAATAAATAAATAAAT^ 
— AAAGAAAGAAAGAAAGAA^GAAAACCATCACCTCAAAATGATGAAAGTCA.TATATGA^ 
AACCCACTGTTAACATCATACTCLAATGGTGAAAGATTGAAAGCTTTTCCCATAAGATCA 
GGAACTTCCAAGGGAAGGATGCCTGCTTTCACCACTGCTATTCATCATGGTACTAGAAG 
TTCTAGCCAGAGCAATTAGGCAAGAAAAGGAAATC^ 

GAAATAAAATTATCTGTTTGCAGATGGCATGCATAATTTTATATGTAGAAJ^XTCTAAA 

AGATTCCACAAAAAACTGTTAGAATAAA.TAAATTCAGCAAAGTAGCAGGGTACA 

AAAGGACAAAAATTATCTTCATTTCT.^^ 

GAGAGCAATGTATTTACAA.CAGCAGCAAAAATAATAAA^ 

CAAAGAAGTGAJVAAACATATACAGAGAAAATGACA^ 

AAGACATAAATAAATGTTAACACATTCCATGTTCATAGCCTGGAGGATTCAATATTGTT 

AAAATGTCCATACTAACCAACGCAATCTACAGATTCAJVTGCAATTCCCATCAAA 

AATGACAGGCCGGGTGCAGTGGCTCACACCTGTAATCCCAAGCACTTTGGGAGGCTGAG 

GCGGGAGGAATACTTGAGGCCAGGAGTTTGAGACCAGCCTGGGCAATGTA.GTGAGA.CCT 

CGTCTCTGGGGAJ^AJLAAAATTCCAATGCTA-TATTTTGCA 

AAAATTCGTATAAAJU2ATCTTGAAAA 

GGAGGACTCAAATTTCCTGATTTCAAAACT^ 

AGTCGGTGGCCTGTGGAAGGAACTGAGGAGCACCCTCCAAGCCCAGCAGA.CCTGGGTCC 

CCAGGGAGGAGCTGAGGAGCACCTCTCCATCCTAGCAGACCCLAGGTCCAGCTTTTTC 

CCACCTCGATGAJICCATTCAGGCATTTGCCTACAAACTCAAAGTGACTGGGGGACAGAG 

GACTGCGCTCGCGACTCATGGCAACCTCCCTGCCCAGGATCTGTGAGTAA-AACACATCT 

GAACTTGTTTCTATCACGGCAGTGGATTGAATTTGCACCTTCCATCCTAAGAACCCAGC 

ACTGCCCAGGCCGGGTTTTCCCTGGCATTGGGGAACATAAGTTTGGGCTCCCAGTGCCA 
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GATGATCAGGCAGGCATJ^.CCTGGATACAGGTC^GACGAGAGCCACTGGGGCGTCTGCC 
AGAATAAGTTTCCTGCGTGAGGAACCCCGGTCATGGGGCATCAGCTGTCCTCCTGGTAA 
AACAAGGACATTTTTTTAJLACAAGGAGGTGTATTTGGAAAGGATCCCCTGAAGGGCGCA 
TGGTGAAC\CCTAGGTCCCATTCCCTTCATTCTCCTTAGGACAGGGCTGCCAGCTGCTC 
TGGCACTGGAACTCCAGTTTAGCTGGGGACTCTCAGAACACCTCAAACCCCTACAGAAA 
AAAACCCTCCTGGACAAAGGGCAATGTTCTCCCCTCAAA^ 

TCTTCCTTCTTGAACTTTTCTTCTCCACCTACCCCACTCAGAACGTGCTCCTCTGTCTC 
CCATGTGGGTGGGGACCTGGGCTCTCCTCCCCTGGACCCTTGCCCTGCTGGCTCTGTCA 
GGATGATGGTAAAAGGCTAGAJIC\CTTCTTGCTC\TAACAAAGGCCTCCTAGTCTAGCG 
AGAGAGGCCAGGCTGATGGC^GAGGTATCCCTCAGTTCTTCTGCTCCAGACCCCAAAGG 
CCCTGTGCGTGACCCCCAGTGAGGCTGTGGGATGCCAGCCCCTGGTGTACAGCAGGTCT 
GGCAGCGTGAGTGCTATGGTGCCTC^TCCCTCACGGCCAGGCAGTGACCCTCGTTATCC 
CAGCTACCTGTGAAGAAA^CAAGCC^GATGAAGGTCTGAACCTGAGAAGGAACCTCCTO 
GCTTTGTCAGAAAACAAGAGCATTAGAAGA^^ 

TCCTACCTCTGCAAGTAATGGCTGGAAAGTC^GGATGTCAATTTTTTGTTCCACACATT 
TTCTAAGTCTATCTGGGATTGGATATTGGGGAAGAAAACTTGGAAGTTGTCGTCTTGAG 
TTCCTTAAAAAATTAAGTTCACAAGTTAAAAAT^^ 
ACATTCTTACCTAAJ^CATCCTT^ 

AGAAGAACTGATATATGAATAGCTGGTCTAAGGCAGTGGTTTGAAGCTTTCGTTTTAGC 
TCTGGGATCCTTTCCTCAAACAGTTCCCACAAGTGCTCTACTATTTGTAAACAGGTAAG 
TGTAGCTGCACAGGCTGTGGGAAAAC^GGGCTGGAACCCCCAGAATCTCCTGGAACCAG 
GAGTTTGGAAACAGCTGTTTTTAAC^TTCTATGAGCTAAGCTCTTCCTACTATATTTGA 
CATCAGGACTCCACCTGGGCCTATCCCTGCAAAGCACACATTAGGAAAAGGCTGCTCTG 
TCTGGGTCCTGCCTTCGATTTGTTTTTGTTTTGTTTCCCAGCCTGTGGTTTCTCTGGGG 
AAATTCAGTTTGTTCAGGATGAAGAACATCGCTGCTCTGGAGCCAAGCCCAATGCGGCA 
ACAGAAACAATGGGAAAGGGAC^GACAGGAGAGGAGAAGCCAGGAGAGAACAGGGAJ^AC 
■ — GAAAAGTCAGAGTGAGGTCGATAAAACTCAACTG^ 

TTTATCCCTCATTCCCTCCCAGCTCTCCATTTCAGTGCAGTCTCACCCAAGAGAGAAGC 
CCACATTGCTCTCCGTATTCCATTCACTGTC\TCTCAATTTGGGTTCCACTGGCTCAGC 
CCTCAACAGGAGAAACGGGGTGAAGAC\TTCCCCATGGTTTGTGCTGAGGGTTATGTCA 
GCGCTCACATGCGAGGTGTATTTGGAAAGCATTGATTA 

GAAACCAGGATGATCACTACAGGCTTAGGTCAAC^GCCTGAACAGCTAACAGAACCATG 

TGGCCTCAACAGACACGCTGGATTAACAAAATGTGGTTCCTACCATGATGGGTGGCTAG 

AAACTTGGCTGGGAGTGTGCTGGCCTACAAAGCCCCCCAGAACAGATCTGCTAAGTGAA 

CACTCTGCTCTAGCTAAGAGCCAAAGTTGCCTGGCACTGGGGGTCCTAGCAGGAAGTGC 

AGGACCTGTGACAGGCGGCCACGGATGGACAGCTCTGGAAGCAGGCACCCGCCAATGCC 

CGCGTGGGCAGGTCCAGACACCTGGTGCTCCAGGTGTGCTCGAGCCAACCCGGGTCAGG 

CACTCTGGGTCACTAACCTGAGTGGC^GGAAACTGATGTCCTTGTTTTAGATACGCTCC 

CACACAJ\TATGTTAATATTCACGGGGAAGTAACTGACCAGGGGGTTAAAATCCTT 

CTGTTGCTCTGTGTATTTCACTCTTTCTGTTAAGCCAAATGATAACTAGAJLAGCTGCTT 

GGTGTGTCACTAGA-TGA.GAJ^CCAGGGATCCTCCAAGGCTCCTGTC\TTAAAACTCGATC 

CTCGGGGACTCTGCTCTGCCTGGCAGCCGAGGGACTTTCCTGCTCCTTTTCTGGGTTAC 

ATCCCTACAGCCCCTTGATGTTGAAACTGTGCCCCAAAGAGTTAAAGAAACCGATGACT 

AACAGAAATTCTTGAGCCTGCAGGATGGGTGATAAGAAACAACTCCAGCGCTGAGTCTC 

CCCCTGCTTATGACATCAJ^GGACTGGCTGAGAT^^^ 

GAGTTTGTGCAGAGCTTACTGACGTCAC\GCCTGGATTCCCACCGTGTTTCATGCGAAC 

tccctccgaactggcacatgtgacccatgaggtagcatgaaggggtaactatgcacacc 
caaggcctttccagacctcccttttcctt6c\ccaaccacctattaatcccagattcta 
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CCCACTAAACGTTTTCTGAC^A^TTACTGCCTTAAAGCCAGCACAGAGAGACACATTT 

GAGCTTGACTCCTGTCTCCTTGGGGGTTGGTTTTCAATACAAAGCTTTTCTTTTCTCAG 

AAACCCACTGTC GT AGCAATGGC C C C TAGTGCATC AGGCAGTGAGC C CC TTTTTCTCAA 

TAATAATATGC C CAGGG ATC ATGAC TGCTTATTCGCTTAC TGAGTGATTGC TTTGTGCC 

AGAAAACTGCACATTTTACATGTCTGACCTC^TTCAATCTTCAAACC\CACGAGGCAGG 

GACTATCATTTTAC CAATGAG AAC AAGGC TC^GAGTGGTTAAGAAACTTGC CCAAGGTC 

ACAC^GCTTCTTGGTGGTGGTGCTGTGATTCAAACAATGGTCTAACCTTAAAAGGTAAA 

CAACCACATGA.TATTCTTCCCTGGTAACAGGTTTTCCCTTTAGTCTGCAA 

AGTC CATAATC CCTCATC TTGAJ^C TGTAACTGGGGCAAC AC GTGATTTGAAATTCAGAA 

TCTTTCAGTATTAGAAAAGTGACCCACCTCACCCTCAGGGAGGTCTGGGGTA.GTAGCAG 

AGCTCAAACCCATCAGTTATTA.TAGCCGATGGAGGAAAAGTCACATCACACGGGACTAA 

GTCATTACACTTTAAAAAGCCTGCTATTC^GGGCTTTTTCGTTTTATAACTACAGAAGA. 

GAACACGTGGATGTGAA.GTGCTTTCCTGAGACTACCCTCCAGTT>J\AA 

CCTATGTTCCCCAGCCACAAGTCCCAGATCACACACTGCTTCTCCTAACTCCATTCTTA 

AGTGGTACCACATCTTCAAACAGGCTTCAGGTGACC CCACAATCC TGTTGCCCC CAGTG 

CGAATGACACGGGGATGCCGTGCACTACACACCATGGCCTCGTCAGGACGGGAGAGGTG 

GC^GGAGCTGGGTGAGGCCAGCCTGCTCTGAGAGCCACCTTGGAACTGCCAGAGCAGAG 

TGGGGCTTGGGGGAGTGAAGGCCGTCTTCCTTGGGCTTCACGCTGCTGCTGGGCAGCTG 

CAGAGAC AG AACTTG AC CTTC A.GAGC TCCGTGGAAATGC ACTCCATCC CC C AAAT AC AA 

TCATGGAJ^GAGGCTTGAACCCCAGCTTC^TCCCCTTTATAC CTCCCCAGC CAGGTGGT 

AGCCACCGCTTGCCAGTGCACGTCCGCTTTTCCTTCACAGATAAAACTGCCAGACCAAG 

AGC CACACTCATTTC\G TTATGGTACCTTTTCTG TGC GG TCACAACAAC TGTAGTTTTT ■ 

GC GGATGTTAAC GKITTGTGAACTTTTAAGCTTTTTCC AAGA G CGCAGC 

AATTAGTGACTCCTCCCCACTGATAACATTTACTTCAAGGTATCGCCCAJ^TTAGGAAAT 

CX2GAAAAAC\GAGCAAAAGGAC£TCCTTGCAGCGGCCAGGAT 

ATGAAATTTAGTTTTAGTCATAAATAAGATAGGCAGTCAACACAATTTTCCTTTGTACA 
- TACAAGAAAJIGTGACAGAGTTAAGACTCATCT 
GAAAAAAGAGATCTTCTAAAATAAGCTATGTAATTAATATTT^ 

GCACCTACAGGCCCGGTGCGGTGGCTCAGGCCTGTAATCCCAGCACTTTGGAAGACTGA 
GGCGGGCAGATCGC TTGAGCCC AGGAGTTTGAGACCAGC CTGGGCAATATAGGATGAAC 
CTGTCTCTGCAAAAAJVTACACAAAT^^ 

GCTACTCGGGAGGCTGAGGTGGGAGCCTAGGGAGGTGGAGGCTGCAGTGAGCTGTGATC 
ATGCCTCTGC AC TC CAGCC TGGGC AAC AGAGTGAGACC CTGTCTC AAAAAAAJ^AAAAAA 
AAJIAAAAJLAGTAAGCATCTACAATAGTCTTTT^ 

GAAAAAATTTTGAT ATT C C CAATTTT AAGTC AGAATTTTATG ACATG AC CAAAAATAGG 
TAC TATATCT AATGTGCTTTC TGCCCAGTC CTCGCTGTGGCTC TAAC TGGGGCTACATC 
ATCCA'CAXTACACTTTGTGCCCCCCTATGGCTTTTTTTTTTTTTTTGAGACGGAGTCTC 
GCTCTGTCACCCAGCC TGGAGTGCGGTGGCGTGATC TCGGCTC ACTGCAAGCTCCGC CT 
CCCAGGTTCACGC C ATTC TCCTGCCTCAGC CTCCTG AGTAGC TGGGA.CTAC AGGTGC C C 
GCCAC CAC GC CCAG CT AJ\TTTTTTTT ATTTTTAGTAG AG AC AGGGTTTCACTGTGTTAG 
CACCTATGGCTTTTTTGAAAAJVACCTGTTTCTATGTTGTCATAATGTCA 
AAAAGCATTG AAGCTGGC TGCAC CTC AC ACAGCTGACTGTGC AC TTGCCAC C C AC AGCT 
TGATGTTGATGAGCTGTGATTTAJITCAGATTTAAGT^ 

CACCCAGCTGGAJIGGCTATCACGOXTCTCGGGCAGCTTAGGCAGGGA.TGCAGGGACAC 
AGGGAGCTGG AGGAC AGGGCCTGTTTGGTC AGC TGATGTAAC C AACC ACAC C AGGGAGA 
GCGAAGC^CTCA^GTTX3GCGTCTGC<:ATCCCAGCTAGGAAGGGGCCCTTCCTGCCCCT 
CCCTC\GCACACTGTGCTGTCCGTGCCTTGGCCAAATGCCCAGGTGACGGGA.TGGAGAT 
GAC CGAGGCC C C AG GGCTTGGC TATGCGGAAC ACGGGC AAGGC AGTGAAG AGGAGCC AA 
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CGTGATGTCTCAQAACAGCAGAAGC^CAGGGAGCTCAGGCCTCAGAGAGCCTCAGAGAC 
ACCCTGTTTTCTTTGTGAGGGTGAGA-GGCCATCTGGTGAGGACCCTCATGAGTCGTGC^ 
AAGTTAC AGGATC CATTGC C CC AGGTGGGTC AC ACTTGGTAC CC AGCTTC T AG AC C AA? 
GCTTTTTCTTAGG ATTC TGTCACATC AAAAAC GTC AGGTGT AC CTTTTTGT TTTTTTGA 
GACAGAGTC TT GC TC CGTC TC AGGCTGGAGTGCAGTC GTGCGATCTTGGCTCAGTGCAC 
CCTCCGTCACC TGGATTTAAGCGATTCTACTGCCTCAGCCTCCTGAGTAGC TAGGACTA 
CAGGGGCCCAAGTAATTTTTGTATTTTTA 

GCTGGTTTCGAACTC CTGGTCTCAGGTGATCCACCCGCCACAAAGGAGT ACC TTTTATA 
ACACCCAACCTAGAAGTATCAGAGAAACTTAAAAACGCGGC TCTCCCCAGAC CTCCAGG 
CCCTTACTTTCCGTCACAGATGACCA.CAATGAAGGTTTTTCCCCCTGGAGGGA.CAAGCC* 
CCCCAGGCTCAGGCTCTGTCATCTGC TTCCTTGATAGAATGTTGTCTTTTTC TC C ATTT 
TCTCCTTTACAGGTACAATC^CCTGGGAAATAAAATTTT 

GGTAAATTATCTCTAACAGTGCTGGCATGCTTAGTGGAAATGAATACTCTCTCACACAC 

ACAAAGACAGATGCACACAAAAACACGGGGCAAAGCCTCAGGTGGTGGCTG^ 

TAAAGAGAGAGAGCAGCTGGAAGTAGGAGAGGGCTGTGACT AC CAAGTGC TGGGATAGA 

AGGCCTGCCCCGGCGCGCCCAGCAATCTTTCCCTGCTTCGAGGTACTCAGGA.GCCGGCC 

GCCATCTGAAGCAC CTCTTGGTATTTGTC AAGCTGCTTTTTGTCCCTCTC CCTAACGTC 

GTCAGCAAGAGAAGGGC'CTGAAGCCGAAGAGCTCAGAGCACAAAGCACTGTAGCTGCCG 

GGTGTCCTGGCCCCTGCTGCCCGCCCTCTGTCCAGAACTCACAACCCCACGGCCCCCAG 

CCCACCCGACCTCACGCAGTGACAGCCGAAACGTCCACTACCTGGAGAGATTCCTCTGT 

TGTTCACCAAACTGCTCTCCGACATCTGAGAGGTTTTGTTTTTTGTGTGGCTATTTAAT 

GAATTAATATTTTCATCTGATGAJITTCTCCTTCTCAGGAACAGAAACA 

GGGGCACATCTGGTCTTTATCCAGCATTTGGAATCTGAATTGTTTAGAT^ 

cagccagtttacagctgactaagttttgaggaatgtctcctttattcctggaaaaatac 
aattaactatgajvcac^gccaggatttctai^ 

c^Iaaacctaatctaggaagtggtgactgc^aatacacttatgaatataagtcaaaaggt 
— cotttttgggtaaap^ttttgaaaa 

tagtcccagcactttggga.ggctgaggcggcaaatcac aaggtcaggagc tc cagacca 
tcctggctajvcacggtgaaaccccgtctctacta 

GGTGGC GGGC GC CTGTAGT C C C A.GCTACTCGGG AGGC TGAGGC AGGAG AAT GGTGTGAA 
CCCGGGAGGCGGAGCTTGCAGTGAGCTGAGATCGCGCCACTGCACTCCAGCCCGGGCGA 

C GGAGCAAGAC GCTGTCTCAAAJLAAAAAAAAAAJ 1 ^^ 

AGAAAAAAATAAATGACC^AAGTAAAGTTGGTTTTACAGAAATTATTGA 

AAAAGCCATTAATGTAGCTTTATAGAAAATGTTTCA^ 

TCACCATGGTTTTACTTCTTCCTTCCTTTAGGGTTCCAAAATGCGTCACCTGTGTAACT 
TCAATATCACCTTTGTTCTTCAGCAAAATCGTTCCATAGAAATGAGTGGCCTCATGAAC 
AGGGGCGGCGT CTC TGAAAAAAGAGC CC CAGAAAGACATCC C TCTGCAGC AGC AC C ACC 
CCTTCCTCGCCTGCCTCCGCTC CTGAGGATAGAC TGCTGGTGGGGGCACTCAGAGGACC 
ACCTCACCACAGGCAGGGCTTCCACAGACTTAGGTTTCTAAGGAGGATGGCAJ^ACCCTA 
AGGAAGAGAACGATGCACTTCATAAAGTCTAGAGAAAAGGGAGCAGCCCA.GGGGACAGC 
CACAGGCAJIGGAGTCTGAGAACGGTGTCCTGAACGTGCTCCC^^^ 

CTGCTCLAGAGTGaAGGGTTAGC^TGGCATTCTCAGTCCTCTTCTCTCaAGCCAACTCCC 
AAGGTGGTGGTGGGGG AAGC ATGGGCCTTGCCTATT AGGGCTC C AC TAAAC TTGCTTTG 
GAAAGAGTTTTGTTGCTTTGAAGTGTTTGAAAACAATGAAATC^ 

AGAAAATGAGAAGATGAGATTTTACCT^ITCCCTAAAAA^ TTCCCC TT 

CTTC TACTGAJTGCT ACAGGC TTGATTCTCCTCC CTGC C TC TC C ACAACCT C TTTC C AG A 
AGGT AAACC TGC CTC ATGC C AC TTCC CAGTCCGAAGC ACC ATTAAAAAAATGGGGTC TC 
TAGAATGGC TC GC CACAC TTCT GGCC C C TGC TGC AAACTC AG TGAGGTTC CC AGC T ACA 
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GAGCAGCCCCTCCTGGCGGGTGCCTGTTTGGCTCTTCCTTCTCTGTGTGCCCTGGTGTG 
ACAAGGGAGGTACAGGGGCAGAGAGTCC^CAGCGTTGCTGATGGACGGGGTGGGGATTT 
TGC^GAAAGGAGTCGTCAGATGCGACTGCCCGGTGCGTGGCTCAGCCCCCACAGGAGAA 
GCTGCGGAGCTCTTGCCACGGGAAGGCTGTGGGGCTGAGTGGAGAGCAGGAGCTCGCTT 
GGC\CATGATCTGTTTCTAGAGTGCAGAGGGGCTGCGCCACCCGGGGCC\CGAGCTGAG 
GAAC\GGCACCCCAGCTTCCCTGGCTCACCCACCCCAGCAGCTTCTACTCTGTGGTCAG 
GAGGCCTCAGTGGGGAAATGCTGTGGGGGCAGAAACAGGTCTTTTTCAAGCATTACTAG 
CCTAAAGAGAAGGAGAGTGCGGGTTCCGTGCTGTGCTGTCTCCTTAGCGGCCCTAGTGA 
GTTCCTGAAGACCCCCAGGCCACGCTTCAGGCCCCTGGTTCCTGGGCCTGATGCAAGAA 
TGGGACGCCACAGCTTCTGCCTGGGGATGAGGTCAGAGAACAGACAGGAAGCCTGAGGA 
GTCCGACTCAGACATAGGGAGGAGGTGOIGGTCTTATTTCCTGGCACCGCCAGGCTCAG 
CGGACTGGGCTGAAAGCAGAGCTCCTGTGCGTCCCAGGCTTCCGTGGTCAGATGCAGCG 
. GGAGCAGTGCACGTCACATCCACGCCCCACAGGATGGGCCCTAGGCACCCTCC CAAAGG 
AAAGCGTGGTCCAGTGGGGAGGGGGAACTGGGCTCCGAGCACCCAAACAAGCATCTGCC 
AGTGGGCCGAAGGGCAAGGCTCCTAGGAGGCTGAGCCCACCAGGCCCTGTCCCATAACA 
CCTCCCACAGCCCCAAGTCCACCCTCCGGCTGCTACCTTACATGGGGTGCACTGCTCCG 
CACACCCCTGTGGCTCAGTTCACACAGGGCTGTCTGTCTGCA.CCCAGAGGCCCAGCCAG 
GC^CCAGATATGGGGCTGGAAAGCAGACGCTTTCCCC\AACAGAACCTGCATTCTATCG 
GGATCAAAAATAAGCAGACTGATGGAGGAGATGTCTCAGAAGCTCACTGGTGGTGAGTG 
AGGAGTGTGAAAAGAAAATAAAGACTCGGGGCCGGGCGTGGTGGCTCACGCCTGTAATC 
CCAGCAGTTTGGGAGGCTGAGGTGGGCGGATCATGAGGTCAGGAGATTGAGACCATCCT 
GGCTAACACGGTGAAACCCCGTCTCTATTAAAAATACAAAAAATTAGCCAGGCATGGTG 
GCAGGTGCCTATAGTCCCAGCTACTCAGGAGGCTGAGGTGGGTGAATGGCGTGAACCCA 
GGAGGTGGAACTTGCAGTGAGCAGAGATCGCGCCACTGCACTCCAACCTGGGTGACGGA 
GCGAGACTCCATCTCAACGTCTCAAAAAAAAA^J\AAAAAA^ 

GAGTCGGGACCCCACTCACCATGCCAAAAGGAAAAAACTC^GCCAGAAGCTGTCATGAA 
AGAAGCTGCCTTTCCTTTGTCCCCAAGC^GAGAGCTACA^.GACAAGGTTAAACATCTCC 
ATGTTACCTTCTCTTACATCAAAGTGCTGATTTACAC^ACCAACTCTCCCTCCCTGTTC 
CTTTTCCTTTCCTCTTGCAAJITGTGTATTCAGTCATGTGACCGCACCCTCTTTCTCCTC 
CAGCCCACTTTTCTCTTTTAAATATTGAAGGCCTCJ^^ 

GAACCACAGATGGTTCCTGTGGATTTGTGGTCCTTTTTCCCAGGCATGTCCTTCACCTT 
GGCL 2 VAAGTGAACTTCTAACTTGATTGAGACCTGTCTCACATACCTTTTGGTTACAGGAG 
GAAAGGCAGGCAGGGAGGGGGTGGGTCAGAGCTGGGGCTGCCCCACAAGTAGGGAGCTC 
AGGGAAGCCTCGTCATGGCTAGCACACAAJ^GAAGAACACAG 

TTTTTTTTTTTAATTGGAGATGGAGTC TGGAGTGCAGTGGC 

GTGATCTCTGCTCACTGCAACTTCTGCTTCCCGGGTTC\AGCAATTCTCCTGCCTCAGC 

CTCCAGAGTAGCTGGGATTAAGGGTGTGCGCTGCCACGCCTGGCTAATTTTTTGTATTT 

TAGTAGAGATGGGGTTTCACCGTGTTGCCCAGGCTGTTCTCGAACTCCTGAGCTCAGGC 

AATCCGCCCGCCTTGGCCTCCCAAAGTCCTAGGATTAC^GGCGTGAGCCACTGTGGCCA 

GCCAGAAATCCLAATAATTTTAAGAACCAACTACATCCAATG 

ATGTGAAACAJVCAAAACAGAAAAATC 

GGATGAAGAGCGACCCCTAACTCCACCTTC\GTCAACCATAGAGATGGCTGAAAGCTTC 
CAAAAGACAGTCTTTTTTTTTTTTTTGAGATGGAGTGTCACTCTGTTGTGCAGGCTGGA 
GTGC\GTGGCGCGATCTCGGCTCACTGC^AGCTCCGCTTCCTGGGTTCACGCCATTCTC 
CTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGCGCCGGCCACCACGCCTGGCTAATT 
TTTTGTGTTTTTAGTAGAGATGGGGTTTCACCATGTTAGCCAGGATGGTTTCAATCTCC 
CGACCTTGTGATCCACCCACCTCGGCCTCCCAAA.GTGCTGGTATTACGGGCGTGAGCCA 
CCGCGCCCGGCAAGACAGTCTTCTTTTTTTTGAGACAGAGTCTTCCTCTGTCACCCAGG 
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Figure 61t 

CTGGAGTGCAGTGGTGCAATCTCGCCTCACTGCAACCTCTGCCTCTTGGGTTCAAGTGA 
TTCTCTTGCCTCAGCCTCCCAAGTAGCTGG^ 

TAATTTTTGTATTTTTAGTAGAGACAGCK5TTTCTTCATATTGGCCAGGCTGGTCTCGAA 
CTCCTGACCTCATGATCTACCCGCCTCAGCATCCCAAAGCGTTGGGATTACAGGCGTGA 
GCCACCATGACCAGCCTCCTTTTCCTTTCTCTTTTTATTTTTTAAGACAGAGCCTTGCT 
GTGTTGCCCAGGCTAGAGTGCGGTAGC^CGATCACAGCTCGCTGCAGCCTCAAGCTCCT 
AGGCTCAAGCAATCTTCCTGCTTCAACCTCGTGTGTAGCTGGGACCGGAGGTGCACACC 
ACCATGCTCGGCTAATTTTTTTTTTTTTTTTTTTTT^ 

CCCAGGCTGGAGTGCAGTGGCGCGATCTCGGCTa\CTGC^GCTCCACCTCCCGGGTTC 
ACGCCATTCTCTTGCCTCAGCCTCCCAAGCAGCCGGGACTACAGGTGCCCGTCACCACG 
CCCGGCTAJ^TTTTTTGTATTTTTTTAGTAGACACGGGGTTTCACCGTGTTAGCCAGGAT 
GGTCTCGATCCCCTGACCTCATGA.TTC^CCCGCCTCGGCCTCCCAGAGTGCTGAGATTA 
CAGGCGTGAGCCACCGTGCCCGGCCTCGCTTGGCTAATTTTTTAATGTTTTGTAGAGAT 
GGGGTCTCACTATGTTGCCCAGGCTGGTCTCAAATTCCTGGGCTCAGGCAATTCTCCTG 
CCACGGCCTCCTGAAGTGCTAGGGATGCTCTCCTCTTACCCCAACAACTCAGGGCTTGA 
AATGTCTACTATTTGGCTTATTAAAGTAA 

CACATCCCAAAGAAGAAAAGCGTTTTCTTTTTTTGAGACGGAGTTTTACTTGTCGCCCA 
GGCCTGAGTGCAGTGGCGCAATCTCAGCTCACTGCAACCTCTGCCTCTCAGGTTCAAGC 
AATTTTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCGAGCACCACCACGCCCA 
GCTAATTTTGTACTTTTAGTAGAGACAGAGTTTCACCACGTTGGTCAGGCTGGTCTTGA 
ACTCCCAACCTCAGGTGATCCGTCTGCCTCAGCCTCCCAAAGTGCTAGGATTACAGGCG 
TTAGCCACCGCACCTGGCCAGAAAAGCATTTTCCT 

AAAGTGCCCTCCAGACATAGCATGACTGGCTGCTCTCCACCACCACTGCATTGACCACG 
TCACCTCTCCGGGGTGTGTACCCATCTGGCAGTTTCAAGGAGTCCAAGGTAAAGAAGAT 
GCTTTCCTCTAACACCCCGTTCCTTCCACAGAGGCTAGAGATGCAGACCTGGGAGCAGA 

GAGCTGAGCGTCACAGGAAGCAGATGCTGCATGACGACAGGGCGCAGCTCTAACACACG 

CCCAAGTCAGCCCAAAGCA(1^CAAGCTGCACCAGGAAGCTCAAGTCCGCCATCCCGTAG 
CACTGGTCC^GTGATTCTCCAACACACCTTTCTCCTTAGAACATTTTAGCACTGTTGCA 
TAAGCTACAGACCTTAGAATTCAGGTATGCAAGCACTAGGAGTTATTGTTTCCAAACGA 
AACACAGCATTGTCAATAGGAAAACACACTCCTCTTTGGCCATGACAAAGCTTTATTTT 
TCCAGGCTTCCAACACATGCAGGAGAAGCCTGGGCCGTGCAAGTTACCCCTGATGGCAG 
GTCTGCCAGAAGCACAGAGAGGAGCCACTAGTCGGCACGCTACCTTGTCCACGCGCTTG 
TATCTCAGTGGCTTCACTGAGGTGGCTTCGCTGCTCCACGTGCCAGGCCGGATCCGGTA 
CTCAGCCTCCACCCAGTCTCCCTTGCAGGGCTCGAAGCCT 

Figure 62 

SEQ ID NO. : 62 hSPG18 cDNA sequence & 

ccccataccgcgaactttgtagctggtgccttcggaatatgATGGCAAATC\CCTTGTA 
AAGCCTGATAATAGAAATTGCAAGAGGCCAAGAGAATTGGAGTCTCCAGTGCCAGATAG 
TCCACAGCTGTCCTCTCTTGGAAAATC\GATTCATCTTTCTCTGAAATTTC 
TTTATAAAGATGAAGCCTTGGAGAAAGATTC^ 

ATGTTGTCTACCTATGCAAAGCTTTTAAGTGAGAGAGCAGCAGTAGATGCATCTTACAT 
TGATGAGA-TAGATGAACTCTTCAAAGAAGCCAATGCTATTGAAAACT^ 
AAAGAGAGTTCCTGCGACAGAGGTTTACAGTGATTGCAAACAC\TTACACAG 
atatacttgaaataagctgagaatttaacctattattgttataatgaaagaatgacatt 

tatgctttgaaagctctcgagttgtt 

SEQ ID NO.: 63 hSPG18 encoded protein sequence Figure 63 a 
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MANHL VKPDNRNCKRPRELES ? VPDS PQL S SLGKSDS SF SE I S GLFYKDEALEKDLNDV 

SKEESILMLSTYAKLLSER^^^ 

TLHR 

SEQ ID NO • : 64 hSPG25 cDNA sequence ^ l S^ 64* 

CTTCAAGATTATCAATAATCGGAGATACGTATATTTTATTTGTAAA.GAAAACATGGCTG 

CCCTATTCCTACGTGGTTTTGTCCL\AATAGGGAACTGCAJIGACTGGGATATCTAAGTCA 

AAAGAAGCATTCA.TTGAAGC\GTGGAJ\AGAAAGAAGAAAGATAGACTGGTGCTGTATOT 

CAAAAGTGGAAAA.TATAGCACTTTTCGGCTAAGTGATAATATTCAAAATGTAGTCCTO 

AATCCTATAGAGGAAJ^CCAJIAATCACCTGCATTTAACTTTACAAAATAATAATGG 

TTTATTGAAGGATTATCCTCCACAGATGCTGAACAATTGAAGATATTCTTGGACAGAGT 

TCATCAAAACGAGGTTCLAGCCACCTGTGAGACCTGGTAAGGGTGGGAGTGTCTTTTCTA 

GCACAACACAGAAGGAJ^TCAACAA^CTTCATTCCACAAAGTTGATGAG 

AGCAAATCTTTT5AGATAGCAAAAGGAAGTGGGACAGGTGTCCTTCAGAGGATGCCTTT 

GCTTACATCAAAATTGACACTTACTTGCGGAGAGTTATCAGAAAATCAGCACAAGAAG 

GGAAAAGAATGCTCTCATCTAGCTCAGAGATGAATGAGGAATTCTTGAAAGAAAATAAT 

TCTGTAGAATACLAAGAAATCCAAGGCAGATTGTTCGAGGTGTGTAAGCTATAATCGAGA 

GAAACAATTGAAGTTAAAAGAGTTAGAAGAGAATAAGAAJVTTGGAATGTGAATCTTCAT 

GCATCATGAACGCCACTGGAAATCCTTACCTAGATGACATTGGTCTTCTCCAAGCTCTC 

ACTGAGAAAATGGTTTTGGTATTTCTGTTACAACAAGGGTATAGTGACGGTTACACAAA 

GTGGGATAAATTAAAACTATTTTTTC^ 

CCAATTTGGGAAACACCTGTTATATGAATGCAGTGTTACAGTCTCTACTTTCAATCCCA 
TCGTTTGCTGATGATTTACTTAATCAGAGTTTCCCATGGGGTAAAATTCCCCTTAATGC 
TCTTACCATGTGCTTGGCACGGCTACTTTTTTTTAAAGATACCTATAATATAGAAATCA 
AGGAGATGTTACTCTTGAATCTTAAAAAGGCCATTTCAGCAGCTGCAGAGATATTCCAT 

GGCAATGCAGAGAACGAyGCTGATGAGOT " 

TAACATGGAAAAACTCAACACAATTTGGAAGCCTAAAAGTGAATTTGGGGAAGATAATT 
TTCCTAAACAGGTTTTTGCTGATGATCCTGACACCAGTGGGTTTTCTTGCCCTGTCATT 
ACTAATTTTGAGTTAGAGTTGTTGCACTCCATTGCTTGTAAAGCTTGTGGTCAGGTTAT 
TCTCAAGACAGAJ^CTGAJVTAATTACCTCTCCATCAAC 

ATCCTTCATCTATTCAGTCTACTTTTGATCTTTTTTTTGGAGCAGAAGAGCTTGAGTAT 
AAATGTGCAJ^TGTGAGCACAAGACTTCCGTTGGAGTGCACTCATTCAGTAGGCTACC 
TAGAATCCTTATTGTTCACCTCAAACGCTATAGCTTGAATGAGTTTTGTGCATTAAAGA 
AGAATGACCAGGAAGTCATC^TTTCCAAATATTTAAAGGTGTCTTCTCATTGCAATGA^ 
GGCACCAGACCACCTCTTCCCTTGAGTGAGGATGGAGAAJITTACAGATTTCCAATTATT 
AAAAGTTATTCGAJIAGATGACTTCTGGAJ^AC^ 

AATCCAAAGATATCCTGGCTCCAC^CATTGGATCAGATAAGGAGTCTGAACAAAAAAAA 
GGCCAGACAGTCTTTAAAGGGGCAAGCAGAAGACAGCAGCAAJLAGTACCTTGGAAAAA^ 
TTCTAAACCAAATGAGCTAGAJVTCTC 

AACCGTTAGCTCACTTAATGACGTATCTGGAAGATACCTCACTTTGTCAGTTCCACAAA 

GCTGGAGGTAAACCTGCCLAGCAGCCCAGGC^CACCTCTCTCAAAAGTTGACTT^ 

AGTGCCCGA^J^TCCAAAJ^CGAAAGAAATATC^ 

ATAGGATTATCAJVTCCTACTAAAGATTTGTATGAAGATAAAAATATCAGAATTCCAGAA 

AGATTCCAAJVAAGTGTCTGAACAGACTCAGCAGTGTGACGGTATGAGAATCTGTGAACA 

AGCCCCTCAGCAGGCACTGCCTCAAAGCTTTCCAAAGCCAGGCACCCAGGGGCACACAA 

AGAACCTCCTAJ^GACCTACAAJ\ATTAAATCTAC\GAAGTCTAACAGGAATTC 

GCACTGGGTTCCAATAAGAJVTCCAAGAAACAAAGACAT^^ 

AGCCAAGGAJ^CAAAAAGAAATGATGATAAGGGAGATCATACCTACCGGCTCATTAGTG 
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Figure 64b 

TTGTCAGCCATCTTGGGAAGACTCTAAAGTCAGGCC\TTATATCTGTGATGCCTATGAC 
TTTGAGAAACAGATCTGGTTCACTTACGATGATATGCGGGTGTTAGGTATCCAGGAGGC 
CCAGATGCAGGAGGATAGGCGTTGCACTGGGTACATCTTCTTTTACATGCATAATGAGA . 
TCTTTGAAGAGATGTTGAAAAGAGAAGAGAATGCCCAGCTTAATAGCAAGGAGGTAGAG 
GAGACCCTTC^GAAGGAATAA 

Figure 65 

SEQ ID NO.: 65 hSPG25 encoded protein sequence 

MAALFLRGFVQIGNCXTGISKSK2AFIEAVSRKKKDRLVLYFKSGKYSTFRLSDNIQNV 
VLKSYRGNQNHLHLTLQNNNGLFIEGLSSTDAEQLKIFLDRVHQNEVQPPVRPGKGGSV 
FSSTTQKSINKTSFHKVDEKSSSKSFEIAKGSGTGVLQRMPLLTSKLTLTCGELSENQH 
KKRKRMLSSSSFJ4NEEFLKZNNSVEYKXSKADCSROT 

SSCIMNATGNFYXiDDIGLLQALTEKMVLVFLLQQGySDGYTKWDKLKLFFELFPEKICK 
GLPNLGOTCYMNAVLQSLLSIFSFADDLLNQSFPWGKIPLNALTMCLARLLFFSDTXIfl 
EIKEMLLLI^KKAISAiAEIFHGNAQlSroA^ 

DOTPKQWADDPDTSGFSCPVITNFELELLHSIACKACGQVILKTELNNYLSINLPQRI 
KAHPSSIQSTFDLFFGAEELEYKCAKCEHKTSVGVHSFSRLPRILIVHLKRYSLNEFCA 
LKKNDQEVTISKYLKVSSHCNEGTRPPLPLSEDGEITDFQLLKV'IRKMTSGNISVSWPA 
TKESKDILAPHIGSDKESEQKKGQTVFKGASRRQQQKYLGKNSKPNELESVYSGDRAFI 
EKEPLAHLMTYLEDTSLCQFHKAGGKPASSPGTPLSKVDFQTVPENPKRKKYVKTSKFV 
AFDRIINPTKDLYEDKNIRIPERFQKySEQTQQCDGMRICEQAPQQALPQSFPKPGTQG 

HTKISnLLRPTKLNLQKSNP^SLLALGSNKNPRNKDILDKIKSKA^ 
ISWSHLGKTLKSGHYICDAYDFEKQIWFTYDDMRVLGIQEAQMQEDRRCTGYIFFYMH 

NEIFEEMLKREENAQLNSKEVEETLQKE 

Figure 66 

SEQ H> NO.: 66 hSPG27 cDNA sequence 

TACGAATTTAATACGACT.CACTATAGGGAA^TrGGCCCTCGAGGC 

GAGGGCCCGGCTGCCACCCTGTCTGAGAAGTGAGGAGCCTCTCCGCCCGGCAGCCACCC 
CATCTGGTTGAATTAAAGAAAATACTTTATCAGAAGAAGATGGCCACTGCCCAGTTGCA 
GAGGACTCCCATGAGTGCACTGGTATTTCCCAATAAGATATCAACTGAACACCAGTCTT 
TGGTGTTAGTGAAGAGGCTTCTAGCAGTTTCAGTATCCTGTATCACGTATTTGAGGGGA 
ATATTCCCAGAATGCGCTTATGGAACAAGATATCTAGATGGATGCTAGGATGTTATGAT 
GCTTTACAGAAAAAATATCTAAGGATGGTTGTTCTAGCTGTATACACAAACCCAGAAGA 
TCCTCAGACAATTTCACCATTCTGATGTTGGAGCGGCCGCAAGCTTATTCCCTTTAGTG 

AGGGTTAATTTTAGCG 

Figure 67a 

SEQ ID NO.: 67 hSPG34a cDNA sequence 

AGCCGCCGCTGTCGTCCACCATGGTGGTGCTCCGAGGACCCCACCGCTGCCGCCACTGC 

CC\CGCCGATGCCGCTACCCGCTCCGGGCCCCTGGCAAACCCACTGCTTTTCCCCTCCT 

CCCAGCGCCCGCGCTGCCCGCCCTGC^CCCGCTGTCGCAGGTGC.aACGG'TACCACCACC 

CCAGCTGTGAGGCTGCCATCAACACCCACATCAGCCTGGAGCTCCACGCATCCTATGTG 

TACCTGTCCATGGCCTTCTACTTCGACCAGGACGACGCGGCCCTGGAGCACTTTGACTG 

CTACTTCCTGTGCCAGTTGCL^GGAGA^AGGGAGCACGCCCAGGAGCTGATGAGGCTGC 

ACAACCTGCGCGGTGGCCGCATCTGCCTTC^TGACGTCGGGAAGCCAGAGGGCCAAGGC 

TGGGAGAGCGGGCTCAAGGCCATGGAGTGCGCCTTCCACCTGGAGAAGAACATCAACCA 

GAGCCTCCTGGAGCTGC\CCAGCTGGCC\AGGAGAACGGCGACCCCCAGCTTTGCGACT 

TCCTGGAGAACCLACTTCCTGAACCAGCAGGCCAAGACC\TCAAAGAGCTGGGTGGCTAC 

CTGAGCAACCTGCGCAAGATGGGGTCCCCGGAAGCAGGCCTGGCAGAGTACCTCTTTAA . 

CAAGCTC\CCCTGGGCCGa\GCC\GA?ACACACCTGAGCCCAGAC\GGCCCCTCAG^ 
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Figure 67b 

TGGGGTGCCTTCCCCTGCTCGCGCCACC^GGCGGGACGTCCATGTTGCCTTTTCAGAAC 
ATTCTCTTCATTTTTCTCCTCTCAGTTTGACCATTGGTAACAATAAAGTTATCTGTTCT 

SEQ ID NO.: 68 hSPG34a encoded protein sequence Rgure 68 

MVVLRGPHRCRECPRRCRYPL^^ 

NTKISLELHASYVYLSMAfTFDQDDAALZH^ 

ICLHDVGKPEGQGWESGLKAMECAFHLEKN^ 

NQQ AKT IXELGG YL SNLRKMG S PEAGLAEYLFNKLTLGRSQKHT 

SEQ ID NO.: 69 hSPG34b cDNA sequence Figure 69 

GGCCACCCGCCTTTCACTATCCGCCATTCTTGTCACCTCAGCTGCTGCCCTCGCTACCG 

CACCGACTTCGCCCGTGTGCTCGCCTGC^CTTGCGCTGCCCGCCATGGCCACCGCCCAG 

CCGTCGCAGGTGCGCCAGAAGTACGACACCAACTGCGACGCCGCCATCAACAGCCACAT 

CACGCTGGAGCTCTACACCTCCTACCTGTACCTGTCTATGGCCTTCTACTTOIACCGGG 

ACGACGTGGCCCTGGAGAACTTCTTCCGCTACTTCCTGCGCCTGTCGGACGAC^AAATG 

GAGCATGCCCAGAAGCTGATGAGGCTGCAGAACCTGCGCGGTGGCC^CATCTGCCTTCA 

CGATATCAGGAAGCCAGAGTGCCAAGGCTGGGAGAGCGGGCTCGTGGCCATGGAGTCCG 

CCTTCCACCTGGAGAAGAACGTCAf^CCAGAGCCTGCTGGATCTGTACCAGCTGGCCGTG 

GAGAAGGGCGACCCCCAGCTGTGCCACTTCCTGGAGAGCCACTACCTGCACGAGCAAGT 

CAAGACCATCAAAGAGCTGGGTGGCTACGTGAGCAACCTGCGCAAGATTTGTTCCCCGG 

AAGCCGGCCTGGCTGAGTACCTGTTCGACAAGCTCACCCTGGGCGGCCGCGTCAAAGAG 

ACTTGAGCCCAGATGGGCCCCACAGCCACGGGGTCCCTTCCCTGGGTCAGGCCACTAGG 

CGGGGCGTGCATGTTGCCCTTTCAGAACGTTCTCTTCAGTTTTATCTTTCAGTTTTACC 

ATTGTTAGCAAAAAAGTTATC^ 

AAAAAAA 



SEQ ID NO.: 70 hSPG34b encoded protein sequence 6 

MATAQPSQVRQKYDTNCDAAINSHIT^ 

SDDKMEHAQKLMlLQNLRGGHICJliHDIHXPECQGWESGLV 

YQIAVEXGDPQLCEFLESHYLHEQVCT^ 

GRVKET 

SEQ ID NO.: 71 hSPG39a cDNA sequence Figure 71a 

GGGAGAGAGATCTTCCTCTCTCTTCGGGCGTGTTAAGACAGCGGGGTTGGCCTGTACTT 

CCTCTGGCCCTGGCTGAAGAGGGCTAGTGAAACCGTTAAACCCCTAGGCGATCATGGCC 

TTGAGACCTGAGGACCCCAGTAGCGGGTTCCGGCATAGCAACGTGGTGGCCTTCATCAA 

CGAGAAAATGGCCAGGCACACGAAAGGCCCCGAGTTCTATCTTGAGAATATATCCTTAT 

CCTGGGAGAAGGTGGAAGACAAGCTGAGGGCCATACTGGAGGACAGCGAGGTGCCCAGT 

GAGGTCAAAGAGGCCTGCACCTGGGGCAGCCTGGCCTTGGGTGTGCGCTTTGCCCACAG 

GCAGGCACAGCTACAAAGGCAC^GGGTGCGGTGGCTGCACGGCTTCGC 

AATCAGCCGCACAGGCCTTGGCATCAGACCTGAAGAAGCTCAGGGAGCAGCAGGAGACG 

GAACGCAAGGAGGCGGCCTCCCGGCTAAGAATGGCCCAGACCAGCCTCGTGGAGGTGCA 

GAAAGAGAGAGACAAGGAGCTGGTGTCTCCCCATGAGTGGGAGCAGGGGGCAGGGTGGC 

CL\GGCCTGGCCACTGCCGGAGGGGTTTGCACAGAAGGAGCAGCTGAGGAGGAAGAAGAG 

GCGGCGGTGGCTGCTGCTGGTGCTGCTGGAGGAAAAGGAGCAGAAGAAGAGC^GAGGGA 

TGTGGAGGTTGTGGCTGCCCCTGTGGAGGCCATGGCTCCCCCTGTGGAGGCTGGGGCTG 

CCCCCATGGAGACCCAGTTCCCCCACGTGGAGGCCAGGGCTGCCTCCATGGAGACCACA 

GAGAAGCTGGAGAGAATCCTCCTGCAGCTCCTTGGAGATGCTGATCAGGAAAAGTACAC 

CTATTGGGGGCAGAAGGAGGGAGATCTCCGGTCGGTCGAAACAGCCACATCTTATTTCT 
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Figure 71b 

CTGGAACCACGAACCCCTGGTCC\GAGCCTC\TCAGAACCTCTTCCTGTCCAGCT(J(JCT 
GCCTCATACTCATACTCATACTCAAGCCCTTTTTCCTCCTTCTCAGACATACCCACTAT 
ATCCCCTCCACAAGCAACAGTCACAGCACCAGTTCCGCCTCAGCTGCCTTCCGACTGGG 
AGGCCTTTGATACTAGCCTGTGGTCTGATGGGGGGCCCCACAGAATAGACCATCAGGAG 
CACCCAAGAGACAGGAGATACTCCGAACCTCATCAGCAAAGACCTCCAGTATATCGCAG 
GCCAGGGGACTGGGACTGCCCTTGGTGTAACGCTGTGAATTTTTCACGGAGGGATACTT 
GCTTCGACTGTGGGAAGGGAATCTGC-CTGCAAAAACCTCATTGAGTGCAGAAATGCAAA 
ATAGAACCGAAGCATGTATAAAAAAA 

Figure 72 

SEQ ID NO.: 72 hSPG39a encoded protein sequence 

MALRPSDPSSGFHKSNWAFIlTOKMJ^i^TKGPSr'/LENISLSWEKTEDKLRAILEDSE^ 

PSEVKEACTWGSLALGVRFAI^QAQLQRHRWWLHGFAKLHKS^ 
ETERKEAASRLRMAQTSLVWQKE^ 

EEAWAAAGAAGGKGAEEEQP^VEV^/AAPVEAMAPPVEAGAAPMETQFPHVEARAASME 
TTEXLERILLQLLGDADQEKYTYWGQKEGDLRSVETATSYFSGTTNPWSRASSEPLPVQ 
LPASYSYSYSSPFSSFSDIPTISPPQATVTAPVPPQLPSDWEAFDTSLWSDGGPHRIDH 
QEHPRDRRYSEPHQQRPPVYRRPGDWDCPWCNAVNFSRSDTCFDCGKGIWLQKPH 

Figure 73 a 

SEQ ID NO.: 73 hSPG39a genomic DNA sequence 

GGGAGAGAGATCTTCCTCTCTCTTCGGGCGTGTTAAGACAGCGGGGTTGGCCTGTACTT 
CCTCTGGCCCTGGCTGAAGAGGTGAGGCCTGGTGGGAGGTGTCCTAGGGTAGGACAAGC 
CGGTCAGGGGGTCATTAGGACGGTCTTGTCAGAGCGGGTAGGGCGGGGACAAGAGGGCG 
GGAGAAGATGGATGAGGGGAGGGGCTAAGGGGGAGGAAAGGAACCTATTGGCTGCTCCA 
TCCACACAGGGCTAGTGAAACCGTTAAACCCCTAGGCGATCATGGCCTTGAGACCTGAG 
GACCCCAGTAGCGGGTTCCGGCATAGCAACGTGGTGGCCTTCATCAACGAGAAAATGGC 

- CAGGCAC\CGAAAGGGCCCGAGTTCTATGTTGAGAATATATCCTTATCCTGGGAGAAGG 

TGGAAGACAAGCTGAGGGCCATACTGGAGGACAGCGAGGTGCCCAGTGAGGTCAAAGAG 
GCCTGCACCTGGGGCAGCCTGGCCTTGGGAGTGCGCTTTGCCCACAGGCAGGCACAGCT 
ACAAAGGCACAGGGTGCGGTGGCTGG^CGGCTTCGCCAAACTGCACAAATCAGCCGCAC 
AGGCCTTGGCATCAGACCTGAAGAAGCTCAGGGAGCAGC^GGAGACGGAACGCAAGGAG 
GCGGCCTCCCGGCTAAGAATGGCCCAGACCAGCCTCGTGGAGGTGGAGAAAGAGAGAGA 
CAAGGTGAGTTGGAAGCCGCTCCATGC\GTAAGATCCCTCAACTGGTCCCTGCCCAGTA 
CCACTGCCCTGCCCCATTCCCACCCCTCTCCACCCTGCTCCATGGCTTCGCCCTGCCCC 
GCCCTTCCCACCTGGTAGCTCCGTCCTACCCTGCTTAGTGCTCCCGCCTTGCCCCCAGA 
ACACACCTCAGCCCTGCCCACTTCTCTCCAGGAGCTGGTGTCTCCCCATGAGTGGGAGC 
AGGGGGCAGGGTGGCC\GGCCTGGCC^CTGCCGGAGGGGTTTGCACAGAAGGAGCAGCT 
GAGGAGGAAGAAGAGGCGGCGGTGGCTGCTGCTGGTGCTGCTGGAGGAAAAGGAGCAGA 
AGAAGAGCAGAGGGGATGTTGGAGGTTGTGGCTGCCCCTGTGGAGGCCATGGCTCCCCC 
TGTGGAGGCTGGGGCTGCCCCCATGGAGACCCAGTTCCCCCACGTGGAGGCCAGGGCTG 
CCTCCATGGAGACCAC\GAGAAGCTGGAGAGAATCCTCCTGCAGCTCCTTGGAGATGCT 
GATCAGGAAAAGTACACCTATTGGGGGCAGAAGGAGGGAGATCTCCGGTCGGTCGAAAC 
AGCCACATCTTATTTCTCTGGAACCACGAACCCCTGGTCCAGAGCCTCATCAGAACCTC 
TTCCTGTCCAGCTCCCTGCCTCATACTCATACTCATACTC^AGCCCTTTTTCCTCCTTC 
TCAGACATACCCACTATATCCCCTCCACAAGCAACAGTC^CAGCACCAGTTCCGGCTCA 
GCTGCCTTCCGACTGGGAGGCCTTTGATACTAGCCTGTGGTCTGATGGGGGGCCCCACA. 
GAATAGACCATCA.GGAGCACCCAAGAGAC a .GGAGATACTCCGAACCTCATCAGCAAAGA 
CCTCCAGTATATCGCAGGCCAGGGGACTGGGACTGCCCTTGGTGTAACGCTGTGAATTT 
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Figure 73b 

TTCACGGAGGGATACTTGCTTCGACTGTGGGAAGGGAA.TCTGGCTGCAAAAACCTCATT 
GAGTGCAGAAATGCAAAATAGAACCGAAGCATGTATA 

Fisure 74 

SEQ ID NO.: 74 hSPG39b cDNA sequence A 5 

TCTCTTCAGGCGTGTTAAGCAGCGGGGTTGGCCTGTACTTCCTCTGGCCCTGGCTGAAG 

AGGGCTAGTGAAACCGTTAJIGCCCCTAGGCGATC^TGGCCTTGAGACCTGAGGACCCCA 

GTAGTGGGTTCCGGCACGGAAACGTGGTGGCCTTCATCATCGAGAAAATGGCCAGGCAC 

ACGAAAGGCCCCGAGTTCTACTTCGAGAATATATCCTTATCCTGGGAGGAGGTGGAAGA 

CAAGCTCAGGGCCATCCTGGAGGACAGCGAGGTGCCCAGCGAGGTCAAAGAGGCCTGCA 

CCTGGGGCAGCCTGGCCTTGGGTGTGCGCTTTGCCCACAGGCAGGGGCA.GTTA 

CGC^GGGTGCAGTGGCTGCAAGGCTTTGCCAAACTGCACAGATCAGCTGCGCTGGTCTT 

GGCCTCAAACCTGACGGAACTCAAGGAACAGCAGGAGATGGAATGCAJVTGA^ 

TCCAGTTGCAGCTAACCGAGACCAGCCTTGCGGAGGTGCAGAGAGAGCGGGACATGCTG 

AGATGGAAC^TCI^CCATGCCGAGCTGGCACCTCCCCAGGGACAGGGCCLAGGCTACAGT 

GTTTCCLAGGCCTGGCCACTGCCGGAGGGGATTGGACAGAAGGAGCAGGTGAGCAGGAAA 

AGGAGGCGGTGGCTGCTGCTGGTGCTGCTGGAGGAAAAGGAGAGGAGAGGTATGCAGAG 

GOIGGGCCTGCCCCCGCAGAGGTCTTGC^GGGGCTGGGAGGAGGCTTCAGGCAGCCCCT 

CGGAGCTATTGTAGCAGGCAAATTACA^ 

TCAGTACAAACAGCCATGTCTGTCTTCTCTGGGCTTGGGTCCACAGTCTCACTGGAGCC 
TCTTCCTGTCCAGCTCGCTACCTCATTCACATACTCATACCCA.TGCCCTTTGTCCGCCT 
TCTCAGCCATACCCAATATACCCCCTTC^CCAGCAJ^GGTCACAGAACGGGTTCCAACT 
CAGATGCCTTTCAACTGGGGGGCCTCTGATGCTAGCCTGTGGTCAGATGTGGAGGCCCA 
GGGAATAGACCCTCAAGAGCCCCCAAGAGAC^GGAGAGACTCCGAACTCCATCAGCAGA 
GAAGACCTCCAGTATATCGCAGGCC^GGGAACTGGGACTGCCCGTGGTGTA^AAGCTGTG 
AATTTTTCATGGAGGGAAAATTGCTTCCTCTGGGGAGGCGAATACGGCTGCAAAAGCCT 
CAGTAAAT 

Figure 75a 

SEQ ID NO.: 75 hSPG46 cDNA sequence 5 

CGGCGAAAGTCCAGTATGTGGGTCCAGGGTCACTCTTCTAGAGCTTCCGCAACGGAAAG 

TGTGAGTTTTTCAGGAATTGTTCAGATGGATGAAGATACLACATTACGATAAAGTGGAAG 

ATGTGGTTGGAAGTCACATAGAAGATGCAGTAACATTTTGGGCCCAGAGTATCAATAGA 

AATAAGGATATCATGAAGATTGGTTGCTCACTGTCTGAAGTTTGCCCCCAGGCCAGTTC 

AGTTTTGGGGAATCTTGACCCAAACAAGATTTATGGTGGATTATTTTCTC 

GTTGGTACAGATGCW.GTACTGAAAATC\TCAGCGTTGAAAAGTGTCTGGTGAGGTA 

ATTGACTATGGAAATACTGAAATTCTAAATCGATCTGATATAGTTGAAATTCCTTTGGA 

GCTGCAGTTTTCTAGTGTTGCCAAAAAGTATAAACTTTGGGGACTACACATTCCTT 

ATCAAlGAAGTTACCCAGTTTGATCAGGGC^CAACCTTTTTGGGGAGCTTGATTTTTGAA 



TGAGTATGGCAGTGTGGATATAGGGGAAGAGGTC-CTTAAGAAJ\GGATTTGCAGAGAAAT 

GCAGACTTGCTTCCAGAACTGACATCTGTGAGGAAAAJ^AAATTGGATC^ 

GTTCTCAGGAAC C TC AAAAGC C C CATTC C TTTGTGGGGGCAT AG ATCAAAC CAGTC AAC 

CTTCAGC^GGCCCAAGGGGCACTT^^ 

ATGATGC\GGCAATCTTATAACATTC^ 



TGAAAAACTTAAJ\ACA&AGAAGGACGCTCTTCTTC 

AAGTAGAGCAGATTGCCCAGGAGCTGCAGCAAGAGAAGGCAGCTGCTGTGGATTTGACT 
AACCACTTAGAATAC\CTCTGAJIGACCTATATAGATACCAGAATGAAAJ^TCTGGCA^ 
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Figure 75b 

TAAGATGGAAATACTGAAAGA^^ 

ACCTTTCAGATGCTATACAAGTGTTGGATGAAGGGTGCTTTACTACTCCAGCTTCTTTG 
AATGGATTAGAGATAATATGGGCAGAATA*C\GTCTGGCTC\GGAGAJVTATTAJVA & CTTG 
TGAATATGTGAGTGAJ\GGGA^\T ATTTTGATTGCCC AAAGAAATGAA^ TGC ^GCAGAAGC 
TGTACATGTCAGTAGAAGATOT7ATO 
TTAAJ^CATTGCAGGATTTGTCAGTCTC 

AGGAGCAAATTCTGATGAAATACTTAAAAAAT?TTATGA£TGGAAGTGTGATAAAAGAG 
AGGAGTTCACCAGTGTTAGAJVGTGAAAC^GACGCTTCTCTGCACCGTCTTGTAGCATGG 
TT C CAAAGAAC CTTAAAGGTTTTTGACCTATCTG TGGAAGGATCAC TGATTTC AG AAGA 

CGCAATGGATAATATTGATGAAATCCTAGAGAAGACTGAGTCAAGTGTCTGCAAAGAGC 
TGGAGAT AGC TCTGGTTGATC AJ^GGTGATGC AG ACAAGGAGATAAT'TTC AAAT A.CATAT 

AGTCAAGTACTGCAAAJVGATTCATTCAGAGGAAAGGCTCAOTGCCACAGTACAAGCTAA 
GTAC^GGACAGTATTGAGTTTAAAAAGCAGCT^ 

GTGTGGATCAC I^G CTATCCATTAAGAAGAC ATTGAAAAGCTT AAAAGCTC TACTC AGA 
TGGAAATTGGTTGAAAAGAGTAATTTGGAAGAGTCAGATGATCCTGATGGCTCTCAAAT 
TGAGAAAATAAAAGAAGAAJ\T AACTCAGCTGCGCAATAATGT C TTTCAGGAA^ 
ATGAGAGAGAGGAATATGAG ATGCTAACTAGTTTGGCACAGAJ^ATGGTTCC CT GAGCTG 
CCTCTGCTTCATCCTGAAATAGGATTACTCA^^ 

AATGAGCTTGGAACGAGATCTTCTTGATGCTGAGCCCATGAAGGAACTTAGCAGCAAGC 
GTCCTTTGGTACGTTCTGAGGTTMTGGGCAGATAATTCTGTTAAAGGGCTATTCTGTG 
GATGTTGACACAGAJ^GCCAAGGTGATTGAGAGAGCAGCCACCTACCATAGAGCTTGGAG 
AGAAGCTGAAGGAGACTCAGGGTTACTGCCATTGATATTCCTGTTTTTATGTAAGTCTG 
ATCCTATGGCTTATCTGA.TGGTC CCATACTACCCI'AGGGCAAACCTGAATGC TGTTCAA 
GC CAACATGC CTTTAAATTC AGAAGAAACTTTAAAGGTC ATC AAAGGTGTTGC C CAGGG 
TCTGCATACATTGCATAAGGCTGACATAATTCATGGAT^ 

TTGCTTTAAACCGTGAACAAGGAATTGTTGGAGATTTTGACTTCACCAAATCTGTGAGT 
CAGCGAGCCTCGGTGAACATGATGGTTGGTGACTTGAGTTTGATGTCACCTGAGTTG^ 
AATCGGAAAACCTGCTTCTC^ 

GC^TTTCTGTTCAAAATC^.GGAGTTTGAGATAAATAAAGATGG 

CAGTTTCATCTGGATGATAAAGTCAAATCCCTCCTCTGTAGCTTGATATGTTATAGAAG 

TTCAATGACTGCTGAACAAGTTTTAAATGCTGAATGTTTCTTC 

CAGTTCCAAACCCAGAAAJ\AGATACTGAATACACCCTA^ 

AJ^GAC GGAGAACTTGG ATAAATGTATGGAGAAGC CAAGAAAT GGTGAAGC C AAC TTTGA 
TTGTTAAATTATTA^ 

GTTAATACAC\GAAA?ATCTAGAAATGTTCTGGGACTAGTTGAGTTGTATCTTTAGTAT 
TCAGGTTGTGAAAAATAAA.GATGTTTGGCTATGCAAAAAAA 

SEQ ID NO.: 76 hSPG4S encoded protein sequence Fig^e 76a 
MWVQGKS SRA.S ATE SVSF SGIVQMDEIJTHYBKVED WGSHIEDA VTFWAQS 
KIGCSLSEVCPQAS SVT.GNLDPNKIYGGLF SEDQC/TfRCKVTiKI I SVEXCLVRYIDYGN 
TSII^SDIVEIPLZLQFSSVAKKY^^ 

RIKAT S EDGTVI AQ AZYG £VD 1G2EVLXXGF AHXCRIiASRTD I CEEXXLD PGQ L VLRNL 
KS P I ?LWGKRSNQS TF SRPKGHLSEXMTLD LKDZND AG2JL ITFPKS SLAVGD FNLG SNV 
SLEKIXQDQKLIEENEXLKTEKDALLESY^ 

TLXTYIDTRMKNIJ^iQffi ILXEMRKVD 1 SVRFGKDLSDAIQVLDEGCFTT PASIiNGLE I 
IWA£T f SLAQENI KTCZYVSEGNILIAQR^^E^QQKL YHSVEDF ILZVDE S SI^IKRLKTLQ 
DLSVSLEAVYGQAKHGANSDEILKKFYI)WKCDKH£EF 
KVFDLSVEGSLISEDAilDNIDEILEX^^ 
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Figure 76b 

kihseerliatvqakykds iefkkqlisylxkipsvdhlls ixktlkslkallrwklve 
ksnleesddpdgsqiskik2eitqlrnnvfqeiykereeyemltslaqkwfpelpllhp 
eigllxymjsgglltmslerdlldaepmkelsskrpl vrsevngq i illkgy svdvdte 
akvxeraatyhrawreajegdsgllpliflflcksdpmaylw 
nseetlxvmcgvaqglhtlhkadiihgslh^^ 

nmmvgdlsi^pelkmgkpaspgsdlyaygclllwls^/qnqefeinkdgipkvdqfhld 

DKVKSLLCSLICYllSSMTAEQ^/LNAECFLMPKEQSVPNPEKLTEYTLYKKEEEIKTENL 
DKCMEKPRNGEANFDC 

SEQ ID HO.: 77 hSEG64 cDNA sequence Figure 77 

gagggcaccggtgctttgttctgtctgaggccaggaagtttgaccgcgctgccATGCCG 

AACCGTAAGGCCAGCCGGAATGCTTACTATTTCTTCGTGCAGGAGAAGATCCCCGAACT 

ACGGCGACGAGGCCTGCCTGTGGCTCGCGTTGCTGATGCC^TCCCTTACTGCTCCTCA-G 

ACTGGGCGCTTCTGAGGGAGGAAGAAAAGGAGAAATACGC^GAAATGGCTCGAGAATGG 

AGGGCCGCTCAGGGAAAGGACCCTGGGCCCTCAGAGAAGCAGAAACCTGTTTTCACACC 

ACTGAGGAGGCCAGGCATGCTTGTACCAAAGCAGAATGTTTCACCTCCAGATATGTCAG 

CTTTGTCTTTAAAAGGTGATCAAGCTCTCCTTGGAGGCATTTTTTATTTTTTGAACATT 

TTTAGCa^TGGCGAGCTACCTCCTCATTGTGAACAGCGCTTCCTCCCTTGTGAAATTGG 

CTGTGTTAAGTATTCTCTCCAAGAAGGTATTATGGCAGATTTCCACAGTTTTATAAATC 

CTGGTGAAATTCCACGAGGATTTCGATTTCATTGTCAGGCTGCAAGTGATTCTAGTCAC 

AAGATTCCTATTTCAAATTTTGAACGTGGGCATAACCAAGCAACTGTGTTACAAAACCT 

- , — , 0 « « rirrv*r'nn*nnT> , &'T<( m "Ti^H , Vf2rn AOTCTGATG 




ATAGAACCAGAGTCAAL. 1 WlVll 'iWJiW-ii.'uw-jiftnvswwii v---^^ ~ — - 

GATCTACAACTTCTCACTGTAGAGGACCTTGTAGTGGGGATCTACCAACAAAAATTTCT 
CAAGGAGCCCTCTAAGACTTGGATTCGAAGCCTCCTAGATGTGGCCATGTGGGATTATT 
CTAGCAACACAAGGTGCAAGTGGCATGAAGAAAATGATATTCTCTTCTGTGCTTTAGCT 
' GTTTGCAAGAAGATTGCGJ'ACTGCATCAGTAATTCTCTGGCCACTCTCTTTGGAATCCA- 
GCTCACAGAGGCTCATGTACCACTACAAGATTATGAGGCCAGCAATAGTGTGACACCCA 
AAATGGTTGTATTGGATGCAGGGCGTTACCAGAAGCTAAGGGTTGGGAGTTCAGGATTC 
TCTCATTTCAACTCTTCTAATGAGGAACAAAGATCAAACACACCCATTGGTGACTACCC 
ATCTAGGGCAAAAATTTCTGGCCAAAACAGCAGCGTTCGGGGAAGAGGAATTACCCGCT 
TACTAGAGAGCATTTCCAATTCTTCCAGCAATATCCACAAATTCTCCAACTGTGACACT 
TCACTCTCACCTTACATGTCCCAAAAAGATGGATACAAATCTTTCTCTTCCTTATCTTA 
Atgatggtactcttttcaatttctgaaaacagtaacaggcccaacttccttcttactac 

agtcatattaaacagatcaca tcaa tgacaaatgtcac tac cataaaaac tac t taac t 
tgtaaggaaattgttccatagatttaaaaaaattgtggttggagagcatcttggcattt 
gtgct'ttttttcttgagggactgttctgcttcctggctgtatgatgggtatatcatcaa 
agtttggagtcctacatgaacaaaactgacatttttagagttgtacttctgggaatgtt 
atagattgaccactcttccccctgataataaaggtactgaacatctgctatgaaaggcc 

aaaaa 

Figure 78a 

SEQ ID NO.: 78 hSPGS4 encoded protein sequence 

MPNRKASRNAYYFFVQEKI ? ELRRRGL P VARVAD AI P YC S S DWALLRSEEKEKYAEMAR 
EWRAAQGKDPGPSEKQKPVFTPLRRPGMLVPKQNVSPPDMSALSLKGDQALLGGIFYFL 
NIFSHGELPPHCEQRFLPCSIGCVKYSLQEGIMADFHSFINPGEIPRGFRFKCQAASDS 

SHKI" 0 I SNFSRGENQ ATVLQNL YRF 1HPNPGUWP P IYCKSDDRTRVNWCLXHMAKASE x 
RQDLQLLTVFJDLWGIYQQKFLKSPSKTWIRSLI^VAMWDYSSNTRCKWHEENDILFCA 
LAVCKKI AYC I SNS LATLFG IQLTEAHVPLQDYEASNS WPKj^"VLDAGRYQKLRVGS S 
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GFSKrNSSNEEQRSOTPIGDYPSRAKIS^^ 
DTSLSPYMSQKDGYKSFSSLS . 

Figure 79a 

SEQ ID NO.: 79 hSPG85 cDNA sequence 

GCTTCCGAAACCTTACTATGATATTGTTAAGTCAG^^ 

GAACTATGAACCTTCAJ^GATATCCGGTATATTCTGAAGAATGACTTAAAGGATTTTACT 
GGAGCCCAGAGAACTCAACCAACCGAGAGCCCC^GAGTGCAGAGATACGGACTCCATCC 
CGATGTCAA.TGTCTATCTAGGACTGACTTCL\GAACACCCC^GAGAGACACCTGACATGG 
AAATCATAGAACTAAAGGAAATGGGCAGTCAACCTCATTCACCAAGGGTTCACTCTTTA 
TTCACTGAGGGGACACTAGATCCTC^GGCCCCAGATCCATGTCTGATGGCCAGGGAGAC 
TCAGAATCAAGATGCTCCTTGCCCTGCTCCATTTATGGO.GAAGAGGCCAGCAGCGCCA 
GC\C\GGTCAGCCAAGCCTCTGCAGTTTCGAAATCAACGAGATCTACTCAGGCTGCTTG 
ATTTTGGAAGATGACATAGAAGAGCCTCCAGGAGCTGCTTCATCTTTGGAGGCAGACGG 
ACCTAACC\(MTAGATGAACTGAAATCCATGGAAGAAGAC^TGGATAAGATGGAGAGAG 
AGGCGTGTTGTTTTGGCAGTGAGGATGAGAGCTCTTCAAAAGCTGAGACAGAGTACTCT 
TTTGATGACTGGGACTGGCAAAACGGTTCACTCAGTTCACTCAGCCTTCCTGAGTCAAC 
CAGAGAAGCC^GAGCAATTTGAACAACATGTCCACGACTGAGGAGTATCTCATCAGTA 
AGTGTGTGCTGGATCTAAAGATTATGCAGACAATAATGCACGAGAATGATGATAGGCTG 
AGGAATATCGAGCAGATATTAGATGAAGTCGAGATGAAACAGAAGGAACAGGAAGAGCG 
CATGTCTTTATGGGCCACTTCAAGAGAGTTTACAAATGCCTACAAGTTACCTCTGGCCG 
TGGGCCCTCCATCTTTAAACTATATTCCTCCTGTCCTACAGCTTTCAGGGGGTCAGAAG 
CCAGACACCAGTGGCAACTACCCA^CCCTACCAAGATTTCCA^GAATGCTGCCGA 
TTGTGACCCTGGAAAAC^GAACACAGATGAACAATTTCAGTGCACTCAAGGAGCCAAGG 
ACAGTTTGGAAACAAGCAGGATCCAAAATACCAGTAGCCAGGGAAGACCTAGAGAGTCC 
ACTGCCCAAGCCAAAGCCACACAGTTTAATAGTGCACTCTTCACTCTGTCAAGCCACCG 
GCAGGGACCTTCTGCATCACCCAGCTGTCACTGGGACTCTACCAGGATGAGTGTGGAAC 
IZTGTTTCTTCTGAAATCTAT^ 
TTAAAATGGAAAATGGAGGTGAAAGAAATGGCA^^ 

agtacctccttggcatcctcagagtagtctgactttagagagcgaggctgaaaatgagc 

ccgacgccctgctgovgccccccattaggagcccagaaaacacggattggcagcgagtt 

attgagtatcatagggaaaatgatgagccca.gaggaaatggcaagtttgacaagacggg 

caaca^tgactgtgacagtgacc^gcatggcagacagcccaggcttggaagcttcacca 

gtatcaggcacccatctcccagacaaaaggagcaaccagac<:atagtgaagccttcc^ 

gcaagttctgacacattggtggctgtagagaaatcttacagtacctcgagtcccataga 

agaggactttgaaggaatacaaggtgcatttgcccaj^cctcaagtctctggtgaggaaa 

agttco^tgagaaaaattcttggaj^^^^ 

caacctgtacgaagtactgaai3atgaac 

actgaaagagaaagacatatcattgacggatattcaagacctgtctagtatctcctatg 
aacc^gacagctcttttaaggaagcttc^^^ 

ACCAGTGTCAGCACTCCACTCAGCCCAGGGTCCGTTTCTTCAGCTGCCAGTCAGTATAA 

AGACTGCCTTGAAAGTATCAC^TTTCAGGTTAAGACAGAGTTTGCCTCTTGCTGGAACA 

GTCAAGAATTTATTCAA^-CTTTGTCTGATGACTTTATAA-GTGTCCGAGAGAGAGCAAA^ 

AAACTGGATTCTCTCCTTACTTCCTCTGAAACTCCCCCTTCAAGACTGACTGGTCTTAA 

AAGATTGTCTTCATTTATTGGGGCTGGATCCCCCAGCCTTGTTAAGGCATGTGACTCAT 

CACCACCCCATGCCACCC\GAGAJ\GGAGCCTGCCTAAAGTAGAAGCCTTCTCTCAGCAT 

CGCATTGATGAGCTGCCACCACCATCTCAGGAGCTACTTGATGACATTGAGCTCTTGAA 

ACAGCAGCAGGGCTCATCCACGGTGTTGCATGAGAACACAGCAAGTGATGGAGGAGGCA 

CTGCAAATGATCAAkGGCACTTAGAAGAACAJ^GA^ 
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Figure 79b 

AGTATGCTTTTGTCCAAAGAAACTGAAGATCTTGGAGAGGACACAGAGAGAGCTCACTC 

TACTCTGGATGAGGACCTGGAAAGATGGCTGCAGCCACCTGAGGAGAGCGTGGAGCTAC 

AAGACCTTCCCMGGGCTCTGAAAGGGAGACAAATATCAAAGATCAAJU^GTTGGTGAA 

GAGAAAAGAAAJIAGGGAAGATAGCATTACACCAGAGAGAAGGAAATCAGAGGGTGTTCT 

AGGGACTTCTGAAGAJ^GATGAACTAJ^-ATCCTGTTTTTGGAAGCGACTAGGTTGGTCCG 

AATCATCCAGGATAATCGTGCTGGATCAGAGTGACTTGTCAGACTGATTGGAATTGGAT 

CATAGACGGACTCCTGGCCTGAGTTTGAGTGTCCTGGTTGTAAGCTCCTTTCTTCTCTT 

TCTGCTTCAGTTGCTGTCAGGGCAGCAGTTCCAGTTCTGTAAGTCTCACTTTGTTCAGC 

TGCCLACLAATAGACATCATCGTTTGGCCCTCTCTGTTAGCAGCACATTCAACCATTTGTT 

TTCAGTCAGATTTCTGAAAA.GTGAGAGGTAGTTTTGATAGTAAAAATTTTTGGTTGTGC 

CTAGAATGGCTTTGGTTTTGTTGATGTTAATT^ 

ATAAAATGTTTAATTTTAATAACAGAAAAA 

Figure 80 

SEQ ID NO", i aro hSPG85 encoded protein sequence 

MNLQDIRYILKNDLKDFTGAQRTQPTESPRVQRYGLK^ 

IELKEMGSQPHSPRVHSLFTEGTLDPQAPDPCLMARETQNQDAPCPAPFMAEEASSPST 
GQPSLCSFEINEIYSGCLILEDDIEEPPGAASSLEADGPNQVDELKSMEEELDKMEREA 
CCFGSEDESSSKAETSYSFDDWDWQNGSLSSLSL^ 
VLDLKIMQTIMHENDDRLRNI^ 

PPSLNYIPPVLQLSGGQKPDTSGNYPTLPRFPRJ^PT^ 

LETSRIQNTSSQGRPRESTAQAKATQFNSALFTLSSHRQGPSASPSCHWDSTRMSVEPV 
SSEIYNAJISRNKDDGKVHLXWKM^^ 

ALLQPPIRSPENTDWQRVIEYHRENDEPRGNGKFDKTGNNDCDSDQH^ 

RHPSPRQKSQPEHSEAFQASSDTLVAVEKSYSTSSPIEEDFEG1QGAFAQPQVSGEEXF 

QMIKILGKNA£ILPRSQFQPVRSTEDEQEETSKESPKELKEKDISLTDIQDLSSISYEP 

- -DSSFKEASeKTPK-IiSIHAPTSVSTPLSPGSVSSA^ 

EFIQTLSDDFISVRERAIGCLDSLLTSSETPPSRLTGLiGElLSSFIGAGSPSLVKACDSSP 

PHATQRRSLPKVEAFSQHRIDELPPPSQELI^ 

NDQRHLEEQETDSKKEDSSMLLSKETEDLGEDTERAHSTLDEDLERWLQPPEESVELQD 
LPKGSERETNIKDQKVGESKRKREDS1TPERRKSEGVLGTSEEDELXSCFWRLGWSES 

SRIIVLDQSDLSD . 

SEQ ID NO.: 81 hS?GI3 long transcript cDNA sequence Figure 81a 

actgttagtccaagctgaattccgggcgATGGCGGCAGAGGCTTCGAAGACTGGGCCTT 

CTAGGTCTTCCTACCAGCGAATGGGGAGGAAGAGTCA.GCCCTGGGGTGCCGCTGAAATC 

CAGTGCACCAGGTGTGGAJ^GGAGGGTATCC\GA.TCATCCGGTCACCA.TTGTGAACTTCA 

ATGTGGACATGCTTTTTGTGAACTATGCTTGTTAATGACTGAAGAATGCACCACAATTA 

TATGCCCTGA.TTGTGAGGTTGCTACAGCTGTAAATACTAGACAACGCTACTACCCAA.TG 

GCTGGATATATTAAGGAAGACTCCATAATGGAAAA^TGCAGCCTA^ 

CCTCCACAGACAAGACTCTTTTGAACTCATCAGCTGTAATGTTGGACACTAATACTGCA 

GAAGAAJVTTGATGAJIGCATTGAATACAGCAC^^ 
TGGAAAAGCACTTGAACACATGCAGAAGC^CGATAGAGC^ 
AAGTTGTGGAGAJ^CAGTTTGACCAJ^CTTTTGGCTTTTTTTGAOT 
CTGTGTGAJVGAAJTTTGCAJ^3AACTA^ 

AAGCTACATTGAAGAGAAJ^AAAAATAATTTGAATGCAGCTATGA^ 
TACAJ\TTATCGCCTTCTCTAAGAACATACTGTGACCTGAATCAGATTATCCGGACTTTG 

CAGTTAACTTCAGATAGTGAATTAGCAC^AGTTAGTTCTCCACAACTAAGGAACCCTCC 
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Figure 81b 

CAGGTTGAGTGTGAATTGGAGTGAGATCATCTGTATC 

AATTTAGGGAC TCAACAAAATGTT ATC C CC AAG AAAATGAAATTAG ACAG AATGTTCAA 

AAGAAATATAATAACAAAAAGGAA.CTTTCTTGTTACGATACATACCCACCGCTAGAAAA 

GAAAAAGGTTGACATGTCTGTCCTAACCAGTGAAGCACCACCACCATCTTTGCAACCTG 

AGACAAATGATGTACATTTAGAJ^GCAAAJLAACTTCCAGCCACAGAAAGAC 

GCATCCCCTAAAJVCC^TTGCTGTGTTACCTCAGATGGGATCTAGCCCTGATGTGATAAT 

TGAAGAAATTATTGAAGACAACGTGGAAAGTTCTGCAGAGCTAGTTTTTGTAA.GCCATC 

TAATAGATCCTTGCCATTTCTAGATTCGGAAGTATTC 

CTGGAGAAGAAGGTGAATGAATTTTGCAATAGGAGTTCACACCTTGATCCTTCAGACAT 
TTTGGAACTAGGTGCAAGAATATTTGTCAGCAGTATTAAAAATGGAATGTGGTGTCGAG 
GAACTATCACAGAATT AATTC C AATAGAGGGTAG AAATACC AGAAAACC TTGTAGTC C A 
• ACCAGATTATTTGTCCATGAAGTTGCACTAATACAAATATTCATGGTAGATTTTGGAAA 
TTCTGAAGTCCTGATTGTCACTGGAGTTGTTGATACCCATGTGAGACCAGAACASTCTG 
CTAAGCAACLATATTGCACTAAATGATTTATGTC^ 

ACTGAAGGGCTGCTAAAAGACATCCAGCCATTAGCACAACCATGCTCATTGAAAGACAT 

TGTTCCACAGAATTCAAATGAJIGGCTGGGAAGAGGAAGCTAAAGTGGAATTT^ 

TGGTAAATAACAAGGCTGTTTCAATGAAAGTTTTTAGAGAAGAAGATGGTGTGCTTATT 

GTAGATCTGCAAAAACCACCACCGAATAAAATAAGCAGTGATATGCCTGTGTCTCTTAG 

AGATGCGCTAGTTTTTATGGAACTAGCAAAAGATCTGATCTAA t aaag t 1 ggt t gaga c 

actttctcattttttcaatgtttctgtattggaaga'agaacttaaagcttcctaatcta 

ttttgttggcgtcattcctctgctgaatttttaaatgttcactctggcttacccgttaa 

tggaagaatttgcataatatctacttagaaagatagtgggccccggac 

SEQ ID NO.: 82 hSPG13 long transcript encoded protein figure 82 
seamence 

MAAEASKTGPSRSSYQRMGRKSQPWGAAEIQCTRCGRHVSRSSGHHCSIjQCGKAFCELC 

- ' — luotescttiicpdcev:atavn^ 

dqlttglhrsastdktli^ssavi^totaee 
qtieeref^iewekqfdqllaf 

LNAAMNIARALQLSPSLRTYCDLN^ 
ICMFNNMGKIEFRDSTKCYPQEN^^ 

SEAPPPSLQPETOTVHLEAKNFQPQKDVATASPKTIAVLPQMGSSPD 
SSAELVFVSHVIDPCHFYIRKYSQIKDAKVT^^ 

S S IKNGMWCRGTI TELI ? IZGRNTRKPC S PTRLFVHEVAL I Q IFMVDFGNS EVL IVTGV 
VDTHVRPEHSAZQHIAXJSTOLCLVLP^ 

EEEAKVEFLKMVNNKAVSMKVFREEDGVLIVDLQK? P PNKI S SDMPVSLRDAIiVFMELA 
KDLI,! 

Figure 83s 

SEQ ID NO. : 83 hSPG13 short transcript cDNA sequence 

ac tgt tagt cc aagc tgaa c t c cgggcgATGGCGGCAGAGGCTTCGAAGACTGGGCCTT 

CTAGGTCTTCCTACC^GCGAATGGGGAGGAAGAGTCAGCC CTGGGGTGC CGCTGAAATC 

CAGTGCACCAGGTGTGGAAGGAGGGTATCC\GATCATCCGGTCACCATTGTGAACTTCA 

ATGTGGACATGCTTTTTGTGAACTATGCTTGTTAATGACTGAAGAATGCACCACAATTA 

TATGCCCTGATTGTGAGGTTGC TACAGCTGTAAAT AC TAGAC AAC GCTAC TACC CAATG 

GCTGGATATATTAAGGAAGAC TCCATAATGGAAAAACTGCAGCCTAAGAC GATAAAGAA 

TTGTTCTCAGGAC TTTAAGAAGAC TGCTGATCAGC TAACTACTGGTTTAGAACGTTCAG 

CCTCaACAGACAAGACTCTTTTGAACTCV^^^ 

GAAGAAATTGATGAAGCZATTGAATACAGCACACCATAGTTTCGAACAGTTAAGCA.TTGC 
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Figure 83b 

TGGAAAAGCACTTGAACAC^^ 

AAGTTGTGGAGAAACAGTTTGACCAACTO 

CTGTGTGAAGAATTTGCAAGAACTACTGATG^ 

AAGCTAC\TTGAAGAGAJ*AAAAAATAATO 

TACAATTATCKCTTCTCTAA^^ 

CAGTTAACTTCAGATAGTGAATTAGC^CA^ 

CAGGTTGAGTGTGAATTGCAGTGAGATCATCTC 

AATTTAGGGACTCAACaj^TGTTATCCCCAAGAAAATGAAATTA 

AAGAAATATAATAAC AAAAAGG AACTTTC TTGTTAC GAT ACATAC C CACCGC TAGAAAA 

GAAAAAGGTTGACATGTCTGTCCTAACC^GTGAAGCACCACCACCTCCTTTGCAACCTG 

AGACAJ^TGATGTACATTTAGAAGCWAAACTTCCAGCCAC^GAAAGACGTTGCAACA 

GCATCCCCTAAAACCATTGCTGTGTTACCTCAGATGGGATCTAGCCCTGATGTGATAAT 

TGAAGAAATTATTGAAGACAACGTGGAAACATGCGGCACAGATGATCTTGGGGAGACAC 

CTAGAT ATCC AAAAAAGCC TCTTCAGAAAAACTCATCTGTTC C TT-TTGGATC AAAAGC A 

GATACTGTAACAACTGTCTGAgcttgatgtagtggagcttactgcatgtgtagttctgc 

agagctagtttttgtaagccatgtaatagatccttgccatttctacattcggaagtatt 

cacaaataaaagacgccaaagtactggagaagaaggtgaa 

Figure 84 

SEQ ID NO.: 84 hS?G13 short transcript encoded protein 
sequence 

MAAEASKTGPSRSSYQRH3RKSQPWGAAEIQCTRCGRRVSRSSGHKCELQCGHAFCELC 
LLMTEECTTI IC?DCEVATAVNTRQRYY?MAGYIKZDSIMEKLQ PKTIKNC S QDFKKTA 
DQLTTGLERSASTDKTLLNSSAVMLDTNTAEEIDEALNTAHHSFEQ 
QTIEERERVIEWEKQFDQLLAFFDSRKK^ 

LNAAMNIARALQLS PSLRTYCDLNQ IIRTLQLTSDSELAQVS S PQLRNPPRLSVNCSEI 
ICMFNNMGKIEFRDSTKCYPQEL^ 

SEAPPPPLQPETI©V^^JCNFQPQKDVATASPKTIAVL _ _ 

'TCGTDDLGETPRYPKKP^ . 

SEQ ID NO.: 85 hSPG39b encoded protein sequence figure 85 

MALRPED P S SGFRKGNWAF I IEKMARHTKG? EF YFENI SL SWEEVEDKLRAILEDSEV 

PSEVKEACTWGSLALGVTIFAHRQGQLQNFJIVQWLQGFAKLHRS 

EMECNEATFQLQLTETSLAEVQRERDMLRWKLFHAELAPPQ^ 

TEGAGSQEKEAVAAAGAAGGKGEERYAEAGPAPAEVLQGLGGGFRQPL^ 

GAEGERSQVSTNSHVCLLWAWVHSLTGASSCPAPYLIHILIPMPFVRLLSHTQYTPFTS 

KGHRTGSNSDAFQLGGL . 

Figure 86a 

SEQ ID NO.: 86 hSPG39b genomic sequence 

TCTCTTCAGGCGTGTTAAGCAGCGGGGTTGGCCTGTACTTCCTCTGGCCCTGGCTGAAG 
AGGTGAGGCCTGGTGGGAGATGTCCTAGGGTAGGACAi^GCCGGTCAGAGGGTCATTAGG 
AGGGTCTTGTCAGAGGTGGGAGGGCGGAGAAGACAGATGAGGGGAGGGGCTAAGGAGGA 
GG AAAGAAAC CTATTGGCTGCTCCATCCAC AC AGGGCTAGTGAAACC GTTAAGC C C C TA 
GGCGATC^TGGCC TTGAGACCTGAGGACCCC^.GTAG TGGGTTC CGGCAC GGAAAC GTGG 
TGGCCTTCATC^TCGAGAAJ^JVTGGCC^GGC^C^^ 

AATATATCCTTATCCTGGGAGGAGGTGGAAGAC^GCTCAGGGCO^TCCTGGAGGAC^ 
CGAGGTGCCCAGCGAGGTCAAAGAGGCCTGC^CCTGGGGCAGCCTGGCCTTGGGTGTGC 
GCTTTGCCCACAGGCAGGGGC^GTTACAAAACCGCAGGGTGO.G 

GCCAAACTGCACAGATCAGCTGCGCTGGTCTTGGCCTCAAACCTGACGGAACTCAAGGA 
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ACAGCAGGAGATGGAATGCAATGAGGCGACCTTCCAGTTGCAGCTAACCGAGACCAGCC 
TTGCGGAGGTGCAGAGAGAGCGGGACATGCTGAGATGGAAGCTCTTCCATGCCGTAAGA 
TCCCCCGAATGGTCCCTGTCCAATGCCTCTGCCCTGCCCCAACCTGTCCGGACCCCCTG 
CCCTGTCCCCAGAJ^TGTGTTTCAGCTCTGCCTACTTCTCTCCAGGAGCTGGCACCTCCC 
CAGGGACAGGGCCAGGCTACAGTGTTTCCAGGCCTGGCCACTGCCGGAGGGGATTGGAC 
AGAAGGAGCAGGTGAGCAGGAAAAGGAGGCGGTGGCTGCTGCTGGTGCTGCTGGAGGAA 
AAGGAGAGGAGAGGTATGC^GAGGCAGGGCCTGCCCCCGOVGAGGTCTTGCAGGGGCTG 
GGAGGAGGCTTCAGGCAGCCCCTCGGAGCTATTGTAGCAGGCAAATTACACCTTTGCGG 
(KC^GAGGGAGAJ^GATCTC^GGTCAG^^ 

GGGTCCACAGTCTCACTGGAGCCTCTTCCTGTCCAGCTCCCTACCTCATTCACATACTC 
ATACCCATGCCCTTTGTCCGCCTTCTCAGCCATACCCAATATACCCCCTTCACCAGCAA 
AGGTCACAGAACGGGTTCCAACTCAGATGCCTTTCAACTGGGGGGCCTCTGATGCTAGC 
CTGTGGTCAGATGTGGAGGCCCAGGGAATAGACCCTCAAGAGCCCCCAAGAGACAGGAG 
AGACTCCGAACTCCATCAGCAGAGAAGACCTCCAGTATATCGCAGGCCAGGGAACTGGG 
ACTGCCCGTGGTGTAAAGCTGTGAATTTTTCATGGAGGGAAAATTGCTTCCTCTGTGGG 
AGGCGAATCTGGCTGCAAAAGCCTC^GTAAAT 

SEQ ID NO.; 87 hSPG70 cDNA sequence Figure 87a 

GACTATATTCCTGTTAAGGGGGAAGTTTGTATTGCCAAGTACACTGTTGATCAGACCTG 

GAACAGAGCAATCATACAAAACGTTGATGTGCAGCAAAAGAAGGCACATGTCTTATA-TA 

TTGATTATGGAAATGAAGAAATAATTCCATTAAJ^CAGAATTTACCACCTCAACAGGAAC 

ATTGACTTGTTTCCTCCTTGTGCCATAAAGTGCTTTGTAGCCAATGTTATCCCAGCAGA 

AGGGAATTGGAGCAGTGATTGTATCAAAGCTACTAAACCACTGTTAATGGAGCAGTACT 

GCTCCATAAJ\GATTGTCGA(LATCTTGGAAGAGGAAGTGGTTA.CCTTTGCTGTAGAAGTT 

GAGCTGCCAAATTCAGGAAAACTTTTAGACCATGTGCTTATAGAAATGGGATATGGCOT 

GAAACCCAGTGGACAAGATTCTAAGAJVGGAAAATGCAGATC^ 

TTGGAAAAATGACAACTGAAAA 

GTGTTAACTTTGAATGTAGGTGATGAGTTTTGTGGTGTGGTTGCCCACATTCAAACACC 
AGAAGACTTCTTTTGTCAACAACTGCAAAGTGGCCGAAAGCTTGCTGAACTTCAGGCAT 
CCCTTAGCAAGTACTGTGATCAGTTGCCTCCACGCTCTGATTTTTATCCAGCCATTGGT 
GATATATGTTGTGCTCAGTTCTCAGAGGATGATCAGTGGTACCGTGCCTCTGTTTTGGC 
TTACGCTTCTGAAGAATCTGTACTGGTCGGATATGTAGATTATGGAAACTTTGAAATCC 

TTAGTTTGATGAGACTTTGTCCCAT^^ 

ATAAAGTGTGTACTAGCAGGAGTAAAGCCATCATTAGGAATTTGGACTCCA.GAAGCTAT 
TTGTCTCATGAAAAAACTTGTACAGAACAAAATAAT 

TGGAAAACAGTTCCCTGGTGGAGCTTA-TTGA.TAAATCCGAGACGCCTCATG^ 
AGCAAAGTTCTCCTAGATGCAGGCTTTGCTGTGGGA.GAACAGA.GTATGGTGACAGA.TAA 

ACCCAGTGACGTGAAAGAAACCAGTGTTCCCTTGGGTG^^^ 

TGGAGTGGA.CATGGGTTGAACTTGGTGTTGACCAAACA.GTAGATGTTGTGGTCTGTGTG 
ATATATAGTCCTGGAGAATTTTATTGCCATGTGCTTAAAGAGGATGCTTTAAAGAAACT 
C^TGATTTGAJ^CAAGTCATTAGCAGAAC^TGCCAGCAGAAG 

AGGCAGAGATAGGACAACCTTGTTGTGCTTTTTTTGCAGGTGATGGTAGTTGGTATCGT 
GCTTTAGTCAAGGAJ\ATCTTACC 2 UIATGG 

A^CATCGAJ^GAAGTTACTGCAGATGAACTCCGAATGATATCA.TCAACATTTTTAAACC 

TTCCCTTTCAGGGAJ^TACGGTGCCAGTTAGCAGATATACAGTCTAGAJ^^ 
TCTGAAGAAGCCATAACAJVGATTCCAGATGTGTGTTGCTGGGATAAAATTGCAAGCCAG 

AGTGGTTGAJ^GTCACTGAAAATGGGATAGGAGTTGAACTCACCGATCTC 
ATCCCAGAATAATTA.GTGATGTTCTGATTGA.TGAACATCTGGTTTTAAAJ^TCTGCTTCA 
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Figure 87b 

CCACATAAAGACTTACCAAATGACAGACTTGTTAATAAACATGAGCTTCAAGTTCATGT 

ACAGGGACTTCAAGCTACCTCTTCAGCTGAGCAATGGAAGACGATAGAATTGCCAGTGG 

ATAAAACTATACAAGCA^TGTATTAGAAATC^TAAGCCCAAACTTGTTTTATGCTCTA 

CCAAAAGGGATGCCAGAAAATCAGGAAAAGCTGTGCATGTTGACAGCTGA^ 

ATACTGCAATGCTCCGAAAAGTCGACCACCCTATAGACCAAGAATTGGAGACGCATGCT 

GTGCCAAATACACAAGTGATGATTTTTGGTATCGTGCAGTTGTTCTGGGGACATCAGAC 

ACTGATGTGGAAGTGCTCT ATGCAGACTATGGAAAC ATTGAAACC CTGC C TCTTTGCAG 

AGTGCAACC AATC AC CTCTAGC CACCTGGCGCTTCCTTTCCAAATTATTAGATGTTCAC 

TTGAAGGATTAATGGAATTGAATGGAMCTCTTCTCAATTAATAA 

AATTTCATGTTGAAT CAGAATGTAATGC TTTCTGTGAAAGG AATT ACAA* GAATGTCCA 

TACAGTGTCAGTTCAGAAATGTTCTGAGAATGGG 

TGAC\TTTGGTCTGGCAAAAAACATCACAC^ 

AAGATGTATAGGACGAATTGCTGCTCCACAGAGTTAa^^^ 

ACATATTCTTCTCTTCCTCTTAAAG^ 

AAAAACTGGTAAAAAGTTAAGTAAGTTAAATCGTATGTTTTCGCCTCTTCTGTGATCAC 
CAATAGGACATCTTCAGGCATATTGGCAGGATAGAGCTAATGGA.GTGAAACCTATTGTA 
AGGCTGTACTTTCGTGATTTAATGACCT^ 

GTAGGTTTATCTGATCGTTTTCCTTTGCTACTGCTAATGGAACTGAACC C C CAGGGGTA 
rTCCAGTTGTAATAGCCTTTCCTTACTGTTGTTTGGTTCTGTGAJ^TGCCTATGTTATTG 

ATATGTGGAGGGC CGG AATTCTTTTGCTA 

Figure 88 

SEQ ID NO. :88 hSPG70 encoded protein sequence 

MEQYCS IKTVD ILEEEWTF AVEVEL PNSGKLLDHVXi I5KGYGLKPS GQD SKjCSNADQ S 

DPEDVGKMTTENNIVVE^^ 

EXQASLSKYCDQLPPRSDFYPAIGDICCAQFSEDDQWYRASV^ 
KTFEILSIJ^C?I IPKLLELPMQAIKCVLAGV^ 
yVDKLEN5SLVrXIDKSETP:^SV5:<VT.LDAGF 
ECVOTLEWTWVELGVDQTVDVVVCV 
PNGFKAEIGQPCCAFFAGDGSWYRALVKEI^ 

rFLNLPFQG IRCQLADI QSH^KHWSEEAIT31FQMCVAGIKLQ ARVVEVTENG IGVELTD 
LSTCYPR! I SDVL IDEHLVLKS AS PHKDLPNDRLVNKHELQVEVQGLQ ATS S AEQWKTI 
ELPVDKTIQANVLEIISPNLFYALPKG 

GDACCAICYTSDDFWYRAVVLGTSDTDVE^ PLCRVQP IT S SKLALPFQ I 

IRCSLEGLMEI^GSSSQLI^LXN^^ 

ADKLVTFGLAKNITPQRQSALOT 

FIEMKKLVKS 



WO 01/66752 



81/108 



PCT/US01/07371 



Human TEX1 1 cDNA sequence: ^ 9a 

TGGTTAAGTCCMGCTGACAATGATGATTTTITTTCCATGGACTTTAAAGAAG 
• TTGTTGAAAACCTGGTTACAAATGATAATTCACCTAACATACCAGAGGCAATT 
GATAGACTCTTCAGCGACATAGCAAATATCAACAGGGAGTCTATGGCTGAAA 
TAACAGACATTCAGATTGAAGAAATGGCAGTAAACCTATGGAACTGGGCACT 
TACCATAGGAGGAGGTTGGCTTGTAAATGAAGAGCAGAAAATTAGATTACAT 
TATGTTGCTTGCAAGTTGCTGAGTATGTGTGAAGCCTCATTTGCCTCAGAAC 
AAAGTATTCAAC GACTG ATTATGATGAATATGAGAATAGGAAAAGAATGGTT 
GGATGCTGGAAATTTTCTAATCGCTGATGAATGTTTTCAAGCTGCTGTGGCC 
AGTCTGGAGCAATTATACGTCAAATTAATTCAAAGGAGCTCCCCTGAGGCTG 
ACTTGACCATGGAGAAGATTACTGTTGAGAGTGACCACTTCAGAGTGCTTTC 
TTACCAAGCAGAGTCAGCAGTTGCTCAAGGGGATTTTCAAAGAGCATCTATG 
TGTGTACTGCAATGTAAAGATATGTTGATGAGGCTCCCCCAGATGACTTCAA 
GTCTTCATCATCTCTGTTACAACTTTGGAGTAGAAACCCAGAAGAATAATAAA 
TATGAAGAAAGTTCTTTCTGGCTTAGCCAAAGCTATGATATTGGGAAGATGG 
ATAAGAAATCTACTGGGCCAGAAATGCTGGCTAAAGTTCTACGGCTATTAGC 
CACGAATTATTTGGATTGGGATGACACCAAATATTATGATAAGGCTCTCAAT 
GCTGTAAACCTAGCAAACAAGGAACATTTAAGTTCTCCTGGGCTTTTCTTAA 
AAATG AAAATCCTCTTG AAAGG CG AAAC ATCTAATG AAG AACTC CTTG AAGC 
TGTCATGGAAATACTACATCTTGACATGCCCTTAGACTTCTGTCTGAACATT 
GCTAAACTGCTGATG GATCA TGAAAGAGAATCTGTTGGGTTTCATTTCCTGA 
CGATTATTCATGMCG TTTTA AGTCATCGGAAAATATTGGAAAAGTTCTGATA 
CTCCATACTG ACATGCTTTTAC AAAG GAAG G AAG AACTTCTTGCC AAG GAGA 
AGATTGAAGAAATC I I I I I AGCTCACCAAACAGGAAGACAACTGACAGCAGA 
ATCAATGAACTGGTTACACAACATTCTGTGGAGACAAGCTGCCAGTAGTTTT 
_ GAGGTACAAAATOCACXGATGCCCTACAATGGTACTATTATTCTCTGAGGT 
TTTATTCAACTGATGAAATGGATCTGGACTTCACCAAGCTGCAGAGGAACAT 
GGCTTGCTGTTACCTGAATTTGCAACAACTTGATAAGGCCAAAGAGGCAGT 
GGCAGAAGCTGAACGACATGACCCTAGGAACGTnTCACTCAATTTTATATA 
TTCAAGATTGCAGTCATAGAGGGCAACTCTGAAAGAGCTTTGCAGGCAATAA 
TTACTTTAGAGAATATATTAACAGATGAAGAGTCAGAAGATAATGATCTAGTT 
GCAGAGAGAGGTTCACCTACCATGCTTCTAAGTTTAGCTGCCCAGTTTGCTC 
TAGAGAATGGACAACAAATTGTGGCAGAAAAAGCTTTGGAATATTTAGCTCA 
ACATTCAGAAGACCAGGAACAAGTTCTTACAGCTGTAAAGTGTTTGCTTCGT 
TTTCTTCTTCCAAAAATTGCTGAAATGCCGGAATCTGAAGATAAGAAGAAAG 
AM TGGA TCGACTTTTGACTTGCCTGAATAGAGCCTTTGTGAAACTTTCTCA 
GCCTTTTGGTGAAGAAGCCTTAAGTTTGGAGTCAAGAGCTAATGMGCTCA 
GTGGTTTCGAAAAACAGCTTGGAACTTGGCTGTGCAATGTGACAAAGATCC 
AGTGATGATGAGAGAGTTTTTTATACTTTCTTATMGATGTCCCAGTTTTGTC 
CTTCTGATCMGTAATTCTGATTGCACGGAAAACATGTTTACTTATGGCAGTT 
G CAGTTGATCTAG AGCAAGG G AG AAAAG CTTCAACAGCTTTTGAAC AG ACC 
ATGTTCCTGAGTCGTGCACTTGAGGAGATCCAGACATGCAATGACATCCATA 
ATTTCCTGAAACAAACAGGGACCTTCTCAAATGATTCATGTGAGAAATTGCT 
TCTGCTGTACGAGTTTGAAGTTAGAGCCAAATTGAATGATCCATTACTGGAA 
AGCTTCCTGGAATCAGTGTGGGAGTTGCCTCATTTAGAAACTAAAACATTTG 
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AAACAATTGCAATAATAGCAATGGAAAAGCCTGCACACTATCCTTTGATTGC 

TCTCAAGGCCTTGAAAAAGGCTTTATTGCTCTACAAAAAGGAAGAACCAATT 

GATATATCACAATACAGCAAATGTATGCACAACTTGGTTAACCTCTCAGTGC 

CAGATGGGGCGTCGAATGTAGAGCTCTGTCCCCTGGAAGAAGTTTGGGGC 

TATTTTGAAGATGCTCTGAGCCACATTAGCCGCACTAAAGACTACCCAGAAA 

TGGAGATTCTCTGGCTGATGGTCAAGTCCTGGAATACCGGAGTACTTATGTT 

TAGCAGGAGCAAGTATGCATCTGCTGAAAAGTGGTGTGGCCTGGCCTTGCG 

TTTCCTTAACCACCTTACCTCCTTCAAGGAAAGCTATGAAACTCAGATG AATA 

TGCTGTATAGTCAGCTTGTGGAAGCATTGAGTAACAACAAGGGCCCAGTTTr 

TCATGAACATGGCTACTGGAGCAAGTCAGATTAGGCAAGCTCATGGCCACA 

TGMGMGATACATTGTCCCGAGATGCTGACTGTTTAAATTTTTGCCAGAGT 

TTCTCTTGAGCTTTTGTTTCTGTTTGCTCAGACCCTGT7TTCATGTTGTTGAA 

TAAACTTTCTAA^ATAAAAG C ATGCTG AATTT 



Figure 89b 
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Human TEX1 1 protein sequence: 

MDFKEWENLVTNDNSPNIPEAIDRLFSDIANINRESMAEITDIQIEEMAVNLWN 

WALTIGGGWLVNEEQKIRLHYVACKLLSMCEASFASEQSIQRLIMMNMRIGKE 

WLDAGNFLIADECFQAAVASLEQLYVKLIQRSSPEADLTMEKITVESDHFRVUSY 

QAESAVAQGDFQRASMCVLQCKDMLMRLPQMTSSLHHLCYNFGVETQKNNK 

YEESSFWLSQSYDIGKMDKKSTGPEMLAKVLRLLATNYLDWDDTKYYDKALNA 

VNLANKEHLSSPGLFLKMKILLKGETSNEELLEAVMEILHLDMPLDFCLNIAKLLM 

DHERESVGFHFLTIIHERFKSSENIGKVLILHTDMLLQRKEELLAKEKIEEIFLAHQ 

TGRQLTAESMNWLHNILWRQAASSFEVQNYTDALQWYYYSLRFYSTDEMDLD 

FTKLQRNMACCYLNLQQLDKAKEAVAEAERHDPRNVFTQFYIFKIAVIEGNSER 

ALQAJITLENILTDEESEDNDLVAERGSPTMLLSLAAQFALENGQQIVAEKALEYL 

AQHSEDQEQVLTAVKCLLRFLLPKIAEMPESEDKKKEMDRLLTCLNRAFVKLSQ 

PFGEEALSLESRANEAQWFRKTAWNLAVQCDKDPVMMREFFILSYKMSQFCP 

SDQVILIARKTCLLMAVAVDLEQGRKASTAFEQTMFLSRALEEIQTCNDIHNFLK 

QTGTFSNDSCEKLLLLYEFEVRAKLNDPLLESFLESVWELPHLETKTFETIAJIAM 

EKPAHYPLIALKALKKALLLYKKEEPIDISQYSKCMHNLVNLSVPDGASNVELCPL 

EEVWGYFEDALSHISRTKDYPEMEILWLMVKSWNTGVLMFSRSKYASAEKWC 

GLALRFLNHLTSFKESYETQMNMLYSQLVEALSNNKGPVFHEHGYWSKSD 



Figure 90 
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Identification of spermatogonia-specific genes 

by cDNA subtraction 



Spermatogonia u u cDNAs/Rsal 

DN As/Rs al heartjb rain,lung,kver,skele tal muscle,kidney 

spleen,stomach,thymus^kin,w7w v testis 

(in great excess) 




cDNA Subtraction 
Subtracted cDNAs 

2300 clones sequenced 



60% Differentially expressed 40% Germ Cell Specific 
or ubiquitous 



23 novel genes | 13 known genes 



Figure 91 
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Known germ cell-specific genes enriched 
by subtraction 



Gene 


Chr 


Source 


Significance 


Rbmy 


Y 


Elliott,1996 


implicated in male fertility 


Deal 


17 


Reijo,1996 


implicated in male fertility 


Ubely 


Y 


Mitchell,1991 


spermatogonial proliferation 


Usp9y 


Y 


Ehrmann,1998 


implicated in male fertility 


Sycpl 


3 


Sage, 1997 


meiosis 


Sycp 2 


2 


Wang, unpublished 


meiosis 


Sycp3 


10 


Klink,1997 


meiosis 


Figla 


6 


Liang, 1997 


bHLH transcription factor 


Ddx4 


13 


Fujiwara, 1994 


germ cell determination in fly 


Tuba3/Tuba7 


6 


Villasante, 1986 


testis specific tubulin isoform 


Ott 


X 


Kerr, 1996 


meiosis 


Mage 


X 


De Plaen, 1999 


melanoma associated antigen 


Stra8 


6 


Oulad-Abdelghani, 1996 


Induced by retinoic acid 



The subtraction is highly sensitive and comprehensive 



Figure 92 
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23 novel mouse germ cell specific genes 



Figure 93 



SUBSTITUTE SHEET (RULE 26) 



WO 01/66752 



PCT/US01/07371 



87/108 



Novel mouse germ cell specific genes 

Gene Significance 



Taf2q 


Transcription initiation factor 


Nxf2 


Nuclear mRNA export factor 


Rnfl7 


RING finger protein interacting with all mad 




members of the Myc oncoprotein pathway 


MovlOll 


Putative RNA helicase 


Piwil2 


Homologue of piwi involved in germ cell 




renewal in fly 


Tktll 


Transketolase 


Usp26 


Ubiquitin specific protease 


FM17 


Ferritin heavy chain; iron metabolism 


Stk31 


Putative protein kinase with One tudor domain 


Rnh2 


Ribonuclease inhibitor 


Tdrdl 


Four tudor domains 


TEX14 


putative protein kinase 


Pramell 


Prame-like gene 


10 genes 


No homology with proteins in the database 



Figure 94 
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Post-transcriptional gene regulation 
of germ cell development 



Genes 


Features 


Nxf2 


Nuclear mRNA exporter (RRM) 


Rnh2 


Ribonuclease inhibitor (LRR) 


Stk31 


One tudor domain 


Tdrdl 


Four tudor domain 


MovlOll 


RNA helicase 


Dazl 


RNA recognition motif (RRM) 


Rbm 


RNA recognition motif (RRM) 


Ddx4 


DEAD box; a putative RNA helicase 


Figure 95 
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Abundance of male germ-cell-specific 
genes on X Chromosome 

Chr. 12 3 4 5 6 7 8 9 10 11 12 1314 1516 171819 X Y 



Syqil 



\Tex20 
Sycp2 



Ubely 



Stra8 
1 



Stk31 



Figla 



TexIS 



Tuba7 



\Pramel1 



Figure 96 
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|ffnf77 



Ddx4 



Tex14 



MovlQH 



Tex19\ 



23 Genes 



are novel 



Tdrdl 



Usp2S 



Tktl1\ 



Rb 
my 



|fex?71 



TexIS 



on 
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Mage 



25 Genes are male-specific 

11 Genes are expressed in both sexes 
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Rapid evolution of spermatogonia-specific genes 
in mouse and human 



Q.5r 



0.4- 



0.2- 



0.1- 



103 949 
Ubiquitous Tissue 



I 



D 



Synonymous 
substitution rate 
Mutation rate: Ks 



Non-synonymous 
substitution rate 
Evolution rate: Ka 
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Hybrid male sterility in mice 



Locus 


Hst-1 


Hst-3 


Cross 


M nu muse id us 


M spretus 




mSiu domesticus 


M$?u domesticus 


Separation 


1 million yrs 


3 million yrs 


Male sterility 


Yes 


Yes 


Mapping 


Chr. 17 t-complex 


Chr, X distal end 


Pathology 


meiotic arrestmeiotic arrest 


X-Y dissociation 


High 


High/Low 


Autosomal dissociation 


High 


High/Low 


Nature of defect 


Genie 


Structural ? 
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Candidate genes for Hst-3 

Fthl17 



Usp26 



Mouse X Chromosome Tktll 



Tex11 



Tex16\ 



Figure 99 
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14 novel human testis-specific genes 



0> 
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BAC physical map and gene structure 

of TEX11 



Exons 
1 



D 



29 



Sequenced in house 



Sequenced by Human Genome 
Project 



The TEX11 gene is ~ 400 kb and consists of 29 exons. 



Figure 101 
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High throughput mutation screening by genomic 

sequencing 



Intron 



Intron 



Exon 



PCR amplification on infertile patient DNA 
Sequencing of PCR product 
Sequence analysis 

380 infertile males and 93 fathers 
29exonsofTEXll 

14,000 PCR reactions 
14,000 sequencing reactions 



Figure 102 
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Mutations found in infertile but 
not normal males 
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Figure 103 
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3' of intron Exon 5' of intron 

Average size (bp) 65 103 88 
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Epigenetic down-regulation of X-linked genes 
during male meiosis 

Mitosis Meiosis Haploid 



PA A B PL LZ PP P RT RB 



Autosomal 



X-linked 



Actin 
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Ubelx 

FM17 
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Figure 105 
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Abundance of spermatogonia genes 
on X Chromosomes 



Origin of Species Origin of human male infertility 



Hybrid sterility X-linked male infertility 



Hybrid sterility in men? 



Figure 106 
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Intronic Variants in TEX11 



Patient 


IVS 


Variant 


Diagnosis 


Found in 
380 infertile 


Found in 93 
normal 


1E03 


2 


T(-17)C 


AZ 


1 


0 




3 


A(35)G 




CV 


57 




3 

- 


T(-22)C 


AZ, SCO, 
TMA 


6 


0 


2H4 


3 


CAT(-22)TAC 




1 


1 


4F9 


4 


G(-48)A 


SCO 


1 


0 




10 


T(-27)C 




CV 


5 


4F12 


11 


T(-28)C 


TMA 


1 


0 


1C02 


14 


G(-64)A 


SCO/TMA 


1 


0 i 




15 


A(48)T 




CV 


22 




17 


ATT, AAC 
GAC -23 to -25 




CV, three 
haplotypes 


Yes 


1G08 


18 


T(-22)C 


.severe. OZ ... 


1 ... 


0 


1C6.4G11 


20* 


T(-10)A 


AZ, TMA 


2 


0 


4B11 


20* 


G(-1)A 


TMA/OZ 


1 


0 


4G1 


21 


A(-34)G 


SCO 


1 


0 




22 


C(-44)T 


normal 


0 


1 


1C2 


23 


G(-119)A 


SCO/TMA 


1 


0 


4C6 


26 


A(-55)C 


SCO 


1 


0 




27 


T58C 




12 


3 




27 


TC(-4,-3)AT 




Variant 


4 


2H9 


27 


A(-44G) 


fructose+ 
AZ 


1 


0 




3TJTR 


T(123)C 




4 


1 



Only 1 variant found in normal males 
All the variants only in infertile males are in the 3* region of introns 
Nearly all are in the AZ, TMA, SCO. 



Figure 107 
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CODING VARIANTS IN TEX 11 



Patient ID 


Source 


Exon 


Variant 


Change 


Diagnosis 










5 


AAA-AGA(320) 
K107R 


mis 




14 


5 


1B12 

WHT3150 


OatesS 


5 


TGG-AGG(325) 
W109R 


mis 


AZ 


1 


0 


4B04 
WHT3171 


OatesS 


5 


C381Tnextto5' 
SS 


silent 


IMA 


1 


0 


3D12 
WHT3417 


OatesS 


6 


GTC-ATC(400) 
V134I 


mis 


AZ/OZ 


1 


0 


3G08 
WHT3500 


SilberS 


6 


CAA-CGA(491) 
Q164R 


mis 


pathologic 
AZ 


1 


0 


1H11 

WHT3759 


SilberS 


15 


Ins(1233) 2bp 


nonsense 


TMA 


1 


0 






15 


GAA- 

AAA(1282) 

Glu428Lys 


mis 




20 


3 


2B06 
WHT3677 


OatesS 


16 


AAC(1449)AAA 
Ans483Lys 


mis 


OZ 


1 


0 


4C04 
WHT2499 


SilberS 


25 


GTG2219GCG 
V740A 


mis 


TMA 


1 


0 


1B07 

WHT3459 


OatesS 


25 


A(2250)T 


silent 


AZ 


1 


0 


4C06 
WHT2546 


SilberS 


26 


T2295C 


silent 


SCO 


1 


0 






27 


T2472C 


silent 




23 


4 



AZ: azoospermia; OZ: oligospermia; TMA: testicular maturation arrest; SCO: Sertoli cell only 



$ = families being pursued and cell lines being further studied 
I 380 = No. in 380 infertile men 
F 93 = No . in 93 normal men 

Figure 108 
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Coding variants found in infertile but not normal males 



SUBSTITUTE SHEET (RULE 26) 



WO 01/66752 



PCT/US01/07371 



Page 104/108 



■a 



it S □ ^ J o- 



No Unknown 
children In Gurope 



Fertile 



-a 



0- 

Fwiilc WIIT2S07 



6 



■a 



VIIT2506 



Y Adopted 
All rertUe firsi cousin 
fertile 



AZ; mac arrest 



Proband WHT2508 has on bp deletion in TAF2Q (X-linked). 
We have his histology 

WHT is heterozygous for this mutation 



WHT2508 pedigree 
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Variants in TAF2Q 



Patient ID 


Source 


Exon 


Variant 


Change 


Diag- 
nosis 


J380 


F 93 


3F9 




3 


T142C 
4bp to SS 


silent 


TMA 


1 


0 


1171 1 

WHT3493 


Uates o> 


4 


CI AT fJTTYl A<S\ 

Asp39Gly : 


mis 


OZ 


1 


0 


2B3 




5 


G375A 


silent 


severe 
OZ 


1 


0 






9 


AGC-GGC(664) 
Ser222Gly 


mis 




66 


11 






10 


6bpDel 


Del(2 au) 




96 


20 


1A11 

WHT2508 


Silber$ 


11 


Del(A928) 


nonsense 


TMA 


1 


0 


Ctrl#lC06 




13 


G1109AC370Y 


missense 


Normal 


0 


1 


















3C12 




IVS2 - 


G(-47)C 




OZ 


-1 


o 


4E08 




IVS3 


A(-24)C 




SCO 


1 


0 


3E05 




IVS4 


A(24)C 




unknown 


1 


0 


2F10 




IVS7 


C(-57)G 




unknown 


1 


0 






IVS8 


A(52)G 






cv 


31 


1B11 




IVS9 


G(9)A 




AZ 


1 


0 






IVS10 


A(91)G 






61+96 
(haplotype) 


10+20 






IVS10 


(-104) 






CV 


29 



I 380 = No. in 380 infertile men 
F 93 = No. in 93 normal men 



Figure 112 
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Mouse Genes 


Gene 


Gene name 


Ex- 


Chr 


GenBank 


Comments 


symbol 


pressi< 


Ml 


no. 




FM17 


Ferritin heavy 


testis 


X 


AF285569 


Ferritin, functioning in iron metabolism, 




polypeptide-Iikc 
17 








consists 24 heavy and light chains 1 


Usp26 


Ubiquitin 


testis 


X 


AF285570 


Predicted proteLi contains His and Cys 


specific protease 








domains conservedamong deubiquitinating 




26 








enzymes b 


Tktll 


Transketolase- 


testis 


X 


AF285571 


Homologous to human transketolase 




1iV*» 1 






• 




Terr 1 1 


Testis expressed 


testis 


Y 




Novel 947-residue protein 




gene 11 










lexio 


Testis expressed 


testis 


Y 

-A. 


AF7R^73 


nfinl 1 1 IQ^Mpiflna --.rn tain * rA f*\y iTi Corino 

iNOvei 1 1 j 7 -residue jnoiein, ncn in serine 




gene 16 










TnfJn 


1 JO A "OSDUwla LCU. 


tecrt-io 


x 


AF285574 


Hiiman fliito^nmal hnmnlntr TA P^F 


factor RNA 








encodes a component 




polymerase II, Q 








of TFIID 


PrameU 


PRAME 


testis 


X 


AY004873 


Homologous to human PRAME, encoding 




(numanj-iiKe j 








a melanoma antigen 












recognized by cytotoxic T cells 6 


Nxf2 


Nuclear RNA 


testis 


X 


AF285575 


Homologous to Mex67p and NXFJ, 


export factor 2 








encoding 












nuclear RNA export factors tt 


Texl3 


Testis expressed 


testis 


X 


AF285576 


Novel 186-residue protein; two closely 




gene 13 








related homologs on human X 












chromosome 


Pramell- 


—PRAME 


-testis- 


—4- - 


a riot cno 

— AF285578- 


Homologous to hxxman PRAME 




(numan)'iiKe i 


- 








Texl7 


Testis expressed 


testis 


4 


AF285579 


Novel 120-residue protein; calculated pi 




gene 17 










Stk31 


Serine/threonine 


testis 


6 


AF285580 


Putative protein kinase 5 with tudor domain 




kinase 31 








(found in RNA-interacting proteins) 1 and 












coiled coil region 


Rnh2 


Ribonnclease 


testis 


7 


AF285581 


Predicted protein contains 6 leucine-rich 




inhibitor 2 








repeats* 


Texl2 


Testis expressed 


testis 


9 


AF285582 


Novel 123 -residue protein with coiled coil 




gene 12 








region 


Texl8 


Testis expressed 


testis 


10 


AF285583 


Novel 80-residue protein 




gene IS 










Texl4 


Testis expressed 


testis 


11 


AF285584 


Predicted protein contains two protein 




gene 14 








kinase domains 1 


Rnfl7 


Ring finger 


testis 


• 14 


AF285585 


A RING finger-containing protein 1 




protein 17 










Piwitt 


piwi 


testis 


14 


AF285586 


Homologous to Drosophila piwi, involved 




(drosophila)-like 








in germ-line stem cell renewal and meiotic 




2 








drive 001 


MovlOll 


MovlO (mouse)- 


testis 


15 


AF285587 


Putative RNA helicase 0 




likel 
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Irene 


urcne name 


XIX- 


v^nr 


uenJoanjc 


Comments 


Ojf LUUVIi 




prcssiou 




no. 






Testis expressed 


testis 


I 


Ar28558o 


Novel 188-residue protein; calculated pi 




gene 20 


and 






10.2 


Tex20 




ovary 








TexlS 


Testis expressed 


testis 


8 


AF285589 


Novel 2785-residue protein 




gene 15 


and 










ovary 








Texl9 


Testis expressed 


testis 


11 


AF285590 


Novel 351 -residue protein with coiled coil 




gene 19 


and 






region 






ovary 








Tdrdl 


Tudor domain 


testis 


19 


AF285591 


Predicted protein contains 4 tudor 
domains' 




protein I 


and 










ovary 
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Mouse spermatogonially expressed gem specific gene and the human orthologs 



Mouse 


Gen Bank No. 


Human 


GenBank No. 


Chr. 


Fthll7 


AF285569 


FTHL17 


AF285592 


X 


Usp26 


AF285570 


USP26 


AF285593 


X 


Tktll 


AF285571 








Texll 


AF285572 


TEX] J 


AF285594 


x 


Texl6 


AF285573 








Taf2a 


AF285574 


TAF2Q 


AF285595 


x 


Pramel3 


AY004873 








Nxf2 


AF285575 


NXF2 


AF285596* 


x 


Tex!3 


AF285576 


TEX13A 


AF285597 


X 


Pramell 


AF285578 


TEX13B 


AF285598 


X 


' Texl7 


AF285579 








Stk3J 


' AE285580 


STK31 


AF285599 


7 


Rnh2 


AF285581 








TexJ2 


AF285582 


TEX12 


AF285600 


11 


TexJ8 


AF285583 








Texl4 


AF285584 


TEX14 


AF285601 


17 


Rnfl7 


AF285585 


RNF17 


AF285602 


13 








AF285603 




Piwil2 


AF285586 








. MovlOll 


AF285587 


MODIOLI 


AF285604 


22 


Tex20 


AF285588 






8 


Tex!5 


AF285589 


TEX15 


AF285605 




-TexJ9 


- - AF285590 






-10 


TdrdI 


AF285591 


TDRDI 


AF285606 
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